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io 15 Background 

^ Cell death is a fundamental aspect of animal 

development. Many cells die during the normal develop- 
ment of both vertebrates (Glucksmann, Biol . Rev. 
Cambridge Philos . Soc 26:59-86 (1951)) and inverte- 

20 brates (Truman, Ann. Rev . Neurosci . 7:171-188 (1984)). 
These deaths -appear to function in morphogenesis, 
metamorphosis and tissue homeostasis, as well as in the 
generation of neuronal specificity and sexual 
dimorphism (reviewed by Ellis et al . , Ann. Rev. Cell 

25 Biol. 7:663-698 (1991))- An understanding of the 
mechanisms that cause cells to die and that specify 
which cells are to live and which cells are to die is 
essential for an understanding of animal development. 
The nematode Caenorhabditis elegans is an 

3 0 appropriate organism for analyzing naturally-occurring 
or programmed cell death (Horvitz et ai . , Neurosci. 
Comment. 1:56-65 (1982)). The generation of the 959 
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somatic cells of the adult C. elegans hermaphrodite is 
accompanied by the generation and subsequent deaths of 
an additional 131 cells (Sulston and Horvitz, Dev. 
Biol. 82:110-156 (1977); Sulston et al., Dev. Biol. - 
5 100:64-119 (1982)). The morphology of cells undergoing 
programmed cell death in C. elegans has been described 
at both the light and electron microscopic levels 
(Sulston and Horvitz, Dev. Biol. 82:100-156 (1977); 
Robertson and Thomson, J. Embryol. Exp. Morph. 67:89- 

10 100 (1982)). 

Many genes that affect e. elegans programmed cell 

death have been identified (reviewed by Ellis et al , 
jtan. Rev. Cell Biol. 7:663-698 (1991)). The activities 
of two of these genes, ced-3 and ced-4, are required 
15 for the onset of almost all C. elegans programmed cell 
deaths (Ellis and Horvitz, Cell 44:817-829 (1986)). 
When the activity of either ced-3 or ced-4 is . , 
eliminated, cells that would normally die instead 
survive and can differentiate into recognizable cell 
20 types and even function (Ellis and Horvitz, Cell 

,^817-829 (1986); Avery and Horvitz, Cell 51:1071-1078 
(1987); White at al., Phil. Trans. R. Soc. Lond. B. 
331:263-271 (1991)). Genetic mosaic analyses have 
indicated that the ced-3 and ced-4 genes most likely 
25 act in a cell autonomous manner within dying cells, 
suggesting that the products of these genes are 
expressed within dying cells and either are cytotoxic 
molecules or control the activities of cytotoxic 
molecules (Yuan and Horvitz, Dev. Biol. 138:33-41 
.30 (1990)). 

cii^arv o ^ ^ p invention 

• This invention relates to genes shown to be 
essential for programmed cell death, referred to herein 
as cell death genes, to their encoded products (RNA and 
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polypeptides) , and to antibodies directed against the 
encoded polypeptides . Methods and probes for 
identifying and screening for other cell death genes, 
including those of vertebrates as well as 
5 invertebrates, and possibly, microbes and plants, are 
described. Agents which mimic or affect the activity 
of cell death genes and methods for identifying these 
agents are also described. Bioassays which detect the 
activity of cell death genes and which are useful for 

10 identifying cell death genes, for testing the effect of 
mutations in cell death genes, and for identifying 
agents which mimic or affect the activity of cell death 
genes are also provided. This invention further 
relates to methods for altering (increasing or 

15 decreasing) the activity of the cell death genes or 

their encoded products in cells and, thus, for altering 
the proliferative capacity or longevity of a cell ; 
population or organism. 



2 0 nematode C. elegans have been identified, sequenced, 
and characterized. These genes have been shown to be 
required for almost all the programmed cell deaths 
which occur during development in C. elegans. Thus, 
two cell death genes and their encoded products (RNA, 

25 polypeptide) are now available for a variety of uses. 

As described herein, the ced-3 and ced-4 genes can 
be used to identify structurally related genes from a 
variety of sources. Some of these related genes are 
likely to also function as cell death genes. 

30 Structural comparison of related cell death genes, as 
well as mutational analysis, can provide insights into 
functionally important regions or features of cell 
death genes and gene products. This information is 
useful in the design of agents which mimic or which 

35 alter the activity of cell death genes. 



Specifically, the ced-3 and ced-4 genes of the 
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This invention further provides methods and agents 
for altering (increasing or decreasing) the occurrence 
of cell death in a cell population or organism. 
Methods and agents, described herein, which decrease . 
cell death are potentially useful for treatment 
(therapeutic and preventive) of disorders and 
conditions characterized by cell deaths, including 
myocardial infarction, stroke, traumatic brain injury, 

10 tllT ra t ±Ve diS£aSeS (e ' g " Kington's 

10 amyotrophic lateral sclerosis, Aider's disease, 

Parkinson's disease, and Duchenne's muscular 
dystrophy, , viral and other types of pathogenic 
infection (e.g., human immunodeficiency virus HIV) 
aging and hair loss. . Methods and agents which ' 
15 increase cell death are also provided and are 

potentially useful for reducing the proliferation or 
size of cell populations, such as cancerous cells 
cells infected with viruses (e.g. , HIV) or other ' 
infectious agents, cells which produce autoreactive 
20 antibodies and hair follicle cells. Such methods and 
agents may also be used to incapacitate or kill 
undesired organisms, such as pests, parasites, and 
recombinant organisms. 

Brief Description of the nr^n^ 
25 Figure l shows the genomic organization and 

nucleotide sequence (Seg. ID #i) of ced-4 and deduced 
ammo acid sequence (Seq. id #2) . The genomic sequence 
of the ced- 4 region was obtained from plasmid C10D8-5 
which rescues the ced-4 mutant phenotype. Two likely' 
3 0 transcriptional start sites are marked with downward 
arrows. The start of the cDNA is marked with a solid 
arrowhead. The positions of eight ced-4 mutations are 
indicated by upward arrows. Numbers on the sides 
indicate nucleotide positions, beginning at the start 
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of C10D8-5. Numbers under the amino acid sequence 
indicate codon positions. Vertical lines between 
nucleotides indicate splice junctions. 

Figure 2 shows the genomic structure of the ced-4 - 
gene and positions of ced-4 mutations. The sizes of 
exons and introns are indicated in base pairs (bp) . 
The downward arrows indicate the positions of the Tc4 
insertion in the ced-4 (nl416) mutant and of eight EMS- 
induced mutations of ced-4. The arrow pointing right 
indicates the direction of transcription. The solid 
arrowhead indicates the translation initiation site. 
The open arrowhead indicates the ochre termination 
codon. 

Figure 3 sho^s the sequence similarities between 



15 the Ced-4 protein 



The consensus sequence of the calcium-binding loop is 



shown at the top. 



and some calcium-binding proteins. 



The positions indicated by X, Y, Z t 



-X, and -Z correspond to vertices of an octahedron. 



The numbers above 
the positions of 
EF-hand sequence. 



the X, Y, Z, -X and -Z correspond to 
the residues within the 29 amino acid 
Amino acids are indicated by the 
single letter code. 0, amino acid with an oxygen- 
containing side chain. *, non-conserved amino acid. 
Positions Y, Z and -X can be any amino acid with 
oxygen-containincr side chains. Position X is usually 
aspartic acid, and position -Z is usually glutamic 
acid. Conserved amino acids are shown in bold-face. 
Deviations from the EF-hand consensus sequence are 



3 0 Figure 4 shows the nucleotide sequence (Seq. ID 

#18) of ced-3 and deduced amino acid sequence (Seq. ID 
#19) . The genomic sequence of the ced-3 region was 
obtained from plasmid pJ107, which rescues the ced-3 
mutant phenotype. The likely translation initiation 

3 5 site is indicated by a solid arrowhead. The SL1 splice 
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acceptor of the RNA is boxed. The positions of 12 
ced-3 mutations are indicated* Repetitive elements in 
the introns are indicated as arrows above the relevant 
sequence. Numbers on the sides indicate nucleotide 
positions, beginning with the start of pJl07. Numbers 
under the amino acid sequence indicate codon positions. 

Figure 5A shows the genomic structure of the ced-3 
gene and the location of the mutations. The sizes of 
the introns and exons are given in bp. The downward 
arrows indicate the positions of 12 EMS-induced 
mutations of ced-3 . The arrow pointing right indicates 
the direction of transcription. The solid arrowhead 
indicates the translation initiation site. The open 
arrowhead indicates the termination codon. 

Figure SB shows the locations of the mutations 
relative to the exons (numbered 1-8) and the encoded 
serine-rich region. 

Figure jS is a Kyte-Doolittle hydrophobicity plot 
of the Ced-3 protein. 



Figure 7 
of C. elegans 



shows a comparison of the Ced-3 proteins 
(line 1) and related nematodes, C. 
briggsae (lini 2) and C. vulgaris (line 3) . The 
conserved amino acids are indicated by " ." . Gaps 
inserted in the sequence for the purpose of alignment 
by 

Figure 8 shows a restriction site map of the ced-4 
region and the relative positions of plasmid C10D8-5, 
plasmid insert pnl416, and three transcripts encoded by 
the region. 

Figure 9 shows physical and genetic maps of the 
ced-3 region on chromosome IV. 

Figure 10 summarizes experiments to localize ced-3 
within C48D1. Restriction sites of plasmid C48D1 and 
subclone plasmids are shown, ced-3 activity was scored 
as the number of cell corpses in the head of LI young 
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animals. ++, the number of cell corpses above 10. + , 
the number of cell corpses below 10 but above 2. 
the number of cell corpses below 2. 

Detailed Description of t he Invention 

The ced-3 and ced-4 genes of C. elegans have been 
shown to be required for almost all programmed cell 
deaths in C. elegans development (Ellis and Horvitz, 
Cell 44:817-829 (1986)). The present work describes 
the cloning, sequencing and characterization of these 
genes. As a result of this work, two genes whose 
activities are required for cell death, referred to 
herein as cell death genes, and their encoded products 
(RNA, polypeptide) are available for a variety of uses. 
Described below are the cloning and characterization of 
the C. elegans ced-4 and ced-3 genes, methods and 
probes for identifying structurally related genes,, 
methods for identifying cell death genes from a variety 
of organisms', methods for identifying agents which 
mimic or which affect the activity of cell death genes, 
and methods and agents for altering cell death activity 
and thus, for altering the occurrence of cell death in 
a cell population or organism. 

The activity of a cell death gene is intended to 
include the activity of the gene itself and of the 
encoded products of the gene. Thus, agents and 
mutations which affect the activity of a gene include 
those which affect the expression as well as the 
function of the encoded RNA and protein. The agents 
may interact with the gene or with the RNA or protein 
encoded by the gene, or may exert their effect more 
indirectly. 
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The ced-4 Gene 



T ne r *<1-4 Gene ch aracterization of 

The cloning, sequencing and character! 

« described in Example 1. 
the c. vegans =ed-4 gene are descr 

Genomic clone, were obtained fro. a ced 
5 generated by transposon tagging. A sub 
L little as 4.4 Kb of vild-type genome 
to co.pl.nent the ced-4 mutant phenotype (see Table .1, 
tables are located at the end of the Detailed 

Description) . „ qH , 

A 2.2 kb mRNA was identified as the ced 4 
transcript. The transcript^ shown to. present 

r^scr^ri r:;:^ of T 

5 2 kb transcript was 
expression. -"^"^^Lg 
15 shown to be expressed primarily « 

Thi s is consistent with the observ t t t ^ 

Listen et a,, 
->n cDNA clones were rurtne^ «~ 

2 0 cDNA encoded product indicates 

fcnalvsis of the cDNA and its encoaeu y 

Analysis ox nrote in is 549 amino acids xn 

that the putative Ced-4 protein i 

i +h iTidure l; Seq. ID #2) and about 62,877 in 

length (Figure 1, q prot ein is highly 

relative molecular mass. The Ce * no • 

25 hydrophilic with a predicted pi ^^^^ 

obvi ous transmembrane re i n ^ Th ^ ^ 

region is a segment of 12 amino 



to 393. 



30 



35 



3 * , n f the ced-4 genomic clone and 

se^ence analysis of t- c ^ ^ 

comparison with the cDNA J ^ ^ 

ced-4 gene contains / i"""' 
44 bp to 557 bp (Figure 2). 

The nucleotide sequences of eight EMS-induced 
ced -/mutations were also determined, of the e>ght 
stations, one results in a single amino acid 
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substitution and the other seven appear to prevent 
either ced-4 RNA splicing or completion of Ced-4 
protein synthesis (Figure 2 and Table 2). These seven 
mutations establish the null phenotype of the ced-4 



for viability. 

Two regions of the inferred Ced-4 protein have 
sequence similarity to known calcium-binding domains 
(Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 

10 52:499-510 (1987)), suggesting that Ced-4 activity and 
hence, programmed cell death may be modulated by 
calcium (see Figure 3 and Example 1) . Calcium has been 
implicated as an essential mediator of cell death in 
other organisms under a variety of conditions . For 

15 example, extracellular calcium is required for 

glucocorticoid-induced thymocyte death (Cohen and Duke, 
J. Immunol. 132:38-42 (1984)), for the deaths of adult 
rat hepatocytes induced by certain toxins in vitro 
(Schanne et al., Science 205:700-702 (1979)), for 

2 0 agonist-induced muscle degeneration in mice (Leonard 

and Salpeter, J. Cell Biol. 82:811-819 (1979)) and for 
neuronal cell death caused by oxygen deprivation or 
excitotoxicity (Coyle et al., Neurosci. Res. Prog. 
Bull. 15:331-427 (1981); Choi, J. Neurosci. 7:369-379 
25 (1987), Choi, Trends Neurosci. 11:465-469 (1988)). It 
is possible that programmed cell death is initiated 
during C. elegans development by an increase in 
intracellular calcium, which activates the Ced-4 
protein to become cytotoxic. On the other hand, 

3 0 certain cells seem to be protected against cell death 

by calcium (e.g., Koike et al., Proc. Natl. Acad. Sci. 
USA 85:6421-6425 (1989); Collins et al . , J. Neurosci. 
11:2582-2587 (1991)), suggesting that increases in 
intracellular calcium levels may inhibit the activity 



5 gene, confirming that ced-4 function is not essential 
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of the Ced-4 protein and thereby prevent programmed 
cell death. 

The level of the ced-4 transcript in eggs is about 
20% that of the actin 1 transcript, which is relatively 
abundant (Edwards and Wood, Dev. Biol. 97:375-390 
(1983)). This level seems higher than might be 
expected if ced-4 were expressed only in dying cells, 
since in an embryo there are usually no more than two 
or three cells dying at the same time. These 
considerations suggest that ced-4 might be transcribed 
not only in dying cells but in other cells as well. 
Perhaps Ced-4 activity, at least during embryonic 
development, is regulated at a post-transcriptional 
level. For example, the Ced-4 protein might have to 
interact with other proteins or other factors (such as 
calcium) to cause cell death. Since the ced-3 gene is 
also essential for programmed cell death in C. elegans, 
one possibility is that the activity of the Ced-4 
protein is dependent upon ced-3 function. 



20 



ThP ged-3 Gene 

The cloning, sequencing and characterization of 
the ced-3 gene are described in Example 2. The ced-3 
gene was cloned by mapping DNA restriction fragment 

25 length polymorphisms (RFLPs) and chromosome walking. 1 
7.5 kb fragment of genomic DNA was shown to complement 
ced-3 mutant phenotypes. A 2.8 kb transcript was 
further identified. The ced-3 transcript was found to 
be most abundant in embryos, but was also detected m 

3 0 larvae and young adults, suggesting that ced-3 is not 
only expressed in cells undergoing programmed cell 
deatn. 

A 2.5 kb cDNA corresponding to the ced-3 mRNA was 
sequenced. The genomic sequence was also determined 
35 (Figure 4; Seq. ID #18) and a comparison with the cDNA 
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sequence revealed that the ced-3 gene has 8 introns 
which range in size from 54 to 1195 bp (Figure 5A) • 
The four largest introns as well as sequences 5' of the 
start codon contain repetitive elements, some of which 
5 have been previously characterized in non-coding 

regions of other C. elegans genes such as fem-1 (Spence 
et al., Cell 50:981-990 (1990)), lin-12 (J. Yochem, 
personnal communication), and myoD (Krause et al., Cell 
53:907-919 (1990) )• The transcriptional start site was 
10 also mapped, and the ced-3 transcript was found to be 
trans-spliced to a C. elegans splice leader, SL1. 



mutations, two are nonsense mutations, and two are 
15 putative splicing mutations (Table 3) . The molecular 
nature of these mutations, together with results of 
genetic and developmental analyses of nematodes 
homozygous for these mutations, indicate that, like 
ced-4, ced-3 function is not essential to viability . 
20 In addition, 10 out of the 12 mutations are clustered 
in the C-terminal region of the gene (Figure SB) , 
suggesting that this portion of the encoded protein may 
be important for activity. 



25 amino acids (Figure 4; Seq. ID #19). The protein is 
very hydrophilic and no significantly hydrophobic 
region can be found that might be a transmembrane 
domain (Figure 6) . One region of the ced-3 protein is 
very rich in serine. Sequence comparison of two 

30 additional ced-3 genes from related nematodes, C. 
briggsae and C. vulgaris , suggests that the exact 
sequence in this serine-rich region may not be 
important but that the serine-rich feature is (Figure 
7; Seq. ID #19-21). This hypothesis is supported by 

35 the analysis of ced-3 mutations: none of 12 EMS-induced 



Twelve EMS-induced ced-3 alleles were also 
sequenced. Eight of the mutations are missense 



The ced-3 gene encodes a putative protein of 503 
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ced-3 mutations is in the serine-rich region (Figure 
5B) . 

The conservation of the serine-rich feature among 
the ced-3 genes of different nematodes suggests that 
5 the serine-rich region may act in semi-specific 

protein-protein interactions, similar to acid blobs in 
transcription factors and basic residues in nuclear 
localization signals. In all these cases, the exact 
primary sequence is not important. 

10 It is possible that the serine residues in the 

Ced-3 and Ced-4 proteins may be targets for a Ser/Thr 
kinase, and that the activity of these proteins may be 
regulated post-translationally by protein 
phosphorylation. McConkey et al. (J. Immunol., 

15 145:1227-1230 (1990)) have shown that phorbol esters, 
which stimulate protein kinase C, can block the death 
of cultured thymocytes induced by exposure to Ca ++ 
ionophores or glucocorticoids (Wyllie, Nature 284: 555- 
556 (1980); Wyllie et al., J. Path. 142:67-77 (1984)). 

20 It is possible that protein kinase c may inactivate 
certain cell death proteins by phosphorylation, and 
thus, inhibit cell death and promote cell 
proliferation. Several agents that can elevate 
cytosolic cAftP levels have been shown to induce 

25 thymocyte death, suggesting that protein kinase A may 
also play a role in mediating thymocyte death. Furthe: 
evidence suggests that abnormal phosphorylation may 
play a role in the pathogenesis of certain cell- 
degenerative diseases. For example, abnormal 

3 0 phosphorylation of the microtubule-associated protein 
Tau is found in the brains of Alzheimer's disease and 
Down's syndrome patients (Grundke-Iqbal et al., Proc. 
Natl, Acad. Sci. USA 83:4913-4917 (1986); Flament et 
al., Brain Res. 516:15-19 (1990)). Thus, it is 

3 5 possible that phosphorylation may have a role in 
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regulating programmed cell death in C. elegans. This 
is consistent with the fairly high levels of ced-3 and 
ced-4 transcripts which suggest that transcription 
regulation alone may be insufficient to regulate 
programmed cell death. 

Structurally and Functionally Related Genes 

As a result of the work described herein f it is 
possible to identify genes which are structurally 
and/or functionally related to ced-3 or ced-4 . Such 
genes are expected to be found in a variety of 
organisms, including vertebrates (e.g., mammals and 
particularly humans), invertebrates (e.g., insects), 
microbes (e.g., yeast) and possibly plants. 
Structurally related genes refer herein to genes which 
have some structural similarity to the nucleotide 
sequences (genomic or cDNA) of one or both of the ced-3 
or ced-4 genes, or whose encoded proteins have some 
similarity to one or both of the amino acid sequences 
of the Ced-3 or Ced-4 proteins. Functionally related 
genes refer to genes which have similar activity to 
that of ced-3 and ced-4 in that they cause cell death. 
Such genes can be identified by their ability to 
complement ced-3 or ced-4 mutations in bioassays, as 
described below. 

Previous studies are consistent with the 
hypothesis that genes similar to the C. elegans ced-3 
and ced-4 genes may be involved in the cell deaths that 
occur in both vertebrates and invertebrates. Some 
vertebrate cell deaths share certain characteristics 
with the programmed cell deaths in C. elegans that are 
controlled by ced-3 and ced-4. For example, up to 14% 
of the. neurons in the chick dorsal root ganglia die 
immediately after their births, before any signs of 
differentiation (Carr and Simpson, Dev. Brain Res. 




-14- 



2:57-162 (1982)). Genes like ced-3 and ced-4 could 
well function in this class of vertebrate cell death. 
In addition, genes related to ced-3 and ced-4 could 
function in many other types of vertebrate cell death 
5 processes, including those involving cells that die 
long after their births and those that die as a result 
of stress (e.g., oxygen deprivation) or disease. 

Genetic mosaic analysis has suggested that the 
ced-3 and ced-4 genes act within cells that undergo 

10 programmed cell death, rather than through cell-cell 
interactions or diffusible factors (Yuan and Horvitz, 
Dev. Biol. 138:33-41 (1990)). Many cell deaths in 
vertebrates seem different in that they appear to be 
controlled, by interactions with target tissues. For 

15 example, it is thought that a deprivation of target- 
derived growth factors is responsible for vertebrate 
neuronal cell deaths (Hamburger and Oppenheim, 
Neurosci. Comment. 1:39-55 (1982)); Thoenen et al . , in: 
Selective Neuronal Death, Wiley, New York, 1987, Vol. 

20 126, pp. 82-85). However, even this class of cell 

death could involve genes like ced-3 and ced-4, since 
pathways of cell death involving similar genes and 
mechanisms might be triggered in a variety of ways. 
Supporting this idea are several in vitro and in vivo 

25 studies which show that the deaths of vertebrate as 
well as invertebrate cells can be prevented by 
inhibitors of RNA and protein synthesis, suggesting 
that activation of genes is required for these cell 
deaths (Martin et al., J. Cell Biol. 105:829-844 

30 (1988); Cohen and Duke, J. Immunol. 132:38-42 (1984); 
Oppenheim and Prevette, Neurosci. Abstr. 14:368 (1988); 
Stanisic et al . , Invest. Urol. 15:19-22 (1978); 
Oppenheim et al . , Dev. Biol. 138:104-113 (1990); 
Fahrbach and Truman, in: Selective Neuronal Death, Ciba 

35 Foundation Symposium, 1987, No. 126, pp. 65-81). It is 
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possible that the genes induced in these dying 
vertebrate and invertebrate cells are cell death genes- 
similar to the C. elegans genes ced-3 and ced-4. 

Also supporting the hypothesis that cell death in 
5 C. elegans is mechanistically similar to cell death in 
vertebrates is the observation that the protein product 
of the C. elegans gene ced-9 is similar in sequence to 
the human protein Bcl-2. ced-9 has been shown to 
prevent cells from undergoing programmed cell death 

10 during nematode development by antagonizing the 
activities of ced-3 and ced-4 (Hengartner, et al . , 
Nature 356:494-499 (1992)). The jbcl-2 gene has also 
been implicated in protecting cells against cell death. 
It seems likely that the genes and proteins with which 

15 ced-9 and jbcl-2 interact are similar as well. 

Genes which are structurally related to ced-3 or 
ced-4 are likely to also act as cell death genes, i 
Structurally, related genes can be identified by any 
number of detection methods which utilize a defined 

20 nucleotide or amino acid sequence or antibodies as 
probes. .For example, nucleic acid (DNA or RNA) con- 
taining all or part of the ced-3 or ced-4 gene can be 
used as hybridization probes or as polymerase chain 
reaction (PCR) primers. Degenerate oligonucleotides 

25 derived from the amino acid sequence of the Ced-3 or 
Ced-4 proteins can also be used. Nucleic acid probes 
can also be based on the consensus sequences of 
conserved regions of genes or their protein products. 
In addition, antibodies, both polyclonal and 

3 0 monoclonal, can be raised against the Ced-3 and/or Ced- 
4 proteins and used as immuhoprobes to screen 
expression libraries of genes. 

One strategy for detecting structurally related 
genes in other organisms is to initially probe animals 

3 5 which are taxonomically closely related to the source 
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of the probes, for example, probing other worms with a 



are detected with the probe than more distantly related- 
5 organisms. Sequences conserved between ced-3 or ced-4 
and these new genes can then be used to identify 
similar genes from less closely related species. 
Furthermore, these new genes provide additional 
sequences with which to probe the molecules of other 

10 animals, some of which may share conserved regions with 
the new genes or gene products but not with ced-3 , 
ced-4, or their gene products. This strategy of using 
structurally related genes in taxonomically closer 
organisms as stepping stones to genes in more distantly 

15 related organisms can be referred to as walking along 
the taxonomic tree. 

Groups of structurally related genes, such asi 
those obtained by using the above-described strategy, 
can be referred to as gene families. Comparison of 

20 members within a gene family, or their encoded 

products may indicate functionally important features 
of the genes or their gene products. Those features 
which are conserved are likely to be significant for 
activity. Such conserved sequences can then be used 

25 both to identify new members of the gene family and in 
drug design and screening. For example, as described 
in Example 2, genes similar to ced-3 from two other 
species of nematodes (C. briggsae and C. vulgaris) were 
identified and characterized. Serine-rich regions were 

30 found in the polypeptides encoded by all three genes. 
Although the sequence of the serine-rich region was not 
well conserved, the number of serines was conserved, 
suggesting that the serine-rich feature, but not the 
exact sequence of the serine-rich region, is 

35 significant for function. 



ced-3 or ced-4 probe. Closely related species are more 
likely to possess related genes or gene products which 




Functionally important regions can also be 
identified by mutagenesis. For example, inactivating 
mutations of ced-3 were found to cluster within a 
region near the COOH-terminus (Figure SB) , suggesting 
that this region is a functionally important domain of 
the Ced-3 protein. Further mutational analyses can be 
carried out on the ced-J and ced-4 genes; mutants with 
novel properties, as well as other regions important 
for activity, may be discovered. Mutations and other 
alterations can be accomplished using known methods, 
such as in vivo and in vitro mutagenesis (see, e.g., 
Ausubel et al . (eds.), Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley- 
Interscience, New York) • 

Bioassays and Agents Which Affect the Activity of Cell 
Death Genes 

This invention further provides bioassays which 
detect the activity of cell death genes. The bioassays 
can be used to identify novel cell death genes, to 
identify mutations which affect the activity of cell 
death genes, to identify genes which are functionally 
related to known cell death genes, such as ced-3 or 
ced-4 , to identify genes which interact with cell death 
genes, and to identify agents which mimic or affect the' 
activity of cell death genes (e.g., agonists and 
antagonists) . For example, the bioassays can be used 
to screen expression gene libraries for cell death 
genes from other organisms. 

In this bioassay, genes or agents are introduced 
into nematodes to test their effect on cell deaths in 
vivo. Wild-type, mutant, and transgenic nematodes can 
be used as appropriate for the effect being tested. In 
one embodiment of this bioassay, transgenic nematodes 
are produced using a candidate cell death gene, a 





-18- 



mutant cell death gene, or genes from an expression 
library, to observe the effect of the transgene on the 
pattern of programmed cell deaths during development of 
the nematode . For example, a gene which is 
5 structurally related to ced-3 can be used to produce a 
transgenic animal from a mutant nematode which 
underexpresses or expresses an inactivated ced-3 gene 
to see if the related gene can complement the ced-3 
mutation and is thus, functionally as well as 

10 structurally related to ced-3. cDNA or genomic 

libraries can be screened for genes having cell death 
activity. Genes which interact with cell death genes 
to enhance or suppress their activity can also be 
identified- by this method. 

15 In another embodiment of the bioassay, wild-type, 

mutant, or transgenic nematodes are exposed to or 
administered peptides and other molecules in order -»to 
identify agents that mimic, increase, or decrease the 
activity of a cell death gene. For example, wilg-type 

2 0 animals can be used to test agents that inactivate or 

antagonize the activity of ced-3 or ced-4 and hence, 
decrease cell deaths, or that activate or enhance ced-3 
or ced-4 activity and increase cell deaths. Mutant 
animals in which ced-3 or ced-4 is inactivated can be 
25 used to identify agents or genes which mimic ced-3 or 
ced-4 in causing cell deaths. Mutant animals in which 
ced-3 or ced-4 is overexpressed or constitutively 
activated can similarly be used to identify agents that 
prevent ced-3 or ced-4 from causing cell death. 

3 0 Transgenic animals in which a wild-type or mutant form 

of an exogenous cell death gene causes excess cell 
deathsi due to overexpression or hyperactivity can be 
used to identify agents that inactivate or inhibit the 
activity of the transgene. Similarly, transgenic 
3 5 animals in which a wild-type or mutant form of an 
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exogenous cell death gene is underexpressed or inactive 
can be used to identify agents that activate or 
increase its activity. Test molecules can be 
introduced into nematodes by microinjection, diffusion, - 
ingestion, shooting with a particle gun, or other 
method . 

Mutated cell death genes with novel properties may 
be identified by the above bioassay. For example, 
constitutively activated or hyperactive cell death 
genes may be isolated which may be useful as agents to 
increase cell deaths. Mutations may also produce genes 
which do not cause cell death but which antagonize the 
activity of the wild-type gene. 

Agents can be obtained from traditional sources, 
15 such as extracts (e.g., bacterial, fungal or plant) and 
compound libraries, or by newer methods of rationale 
drug design. Information on functionally important 
regions of the genes or gene products, gained by 
sequence and/or mutational analysis, as described 
above, may provide a basis for drug design. The 
activity. of the agents can be verified both by in vivo 
bioassays using nematodes which express various forms 
of ced-3, ced-4, or related genes, as described above, 
and by in vitro systems, in which the genes are 
25 expressed in cultured cells, or in which isolated or 
synthetic gene products are tested directly in 
biochemical experiments. The agents may include all or 
portions of the ced-3, ced-4, or related genes, mutated 
genes, and all or portions of the gene products (RNA, 
including antisense RNA, and protein) , as well as 
nucleic acid or protein derivatives, such as 
oligonucelotides and peptides, peptide and non-peptide 
mimetics, and agonists and antagonists which affect the 
activity or expression of the cell death genes. The 
35 agents can also be portions or derivatives of genes or 
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gene products which are not cell death genes but which 
regulate the expression of, interact with, or otherwise 
affect the function of cell death genes or gene 
products. 

5 Uses of the Invention 

Using the above-described probes and bioassays, 
the identification and expression of ced-3 , ced-4 or 
related cell death genes in cultured cells, tissues, 
and whole organisms can be studied to gain insights 

10 into their role in development and pathology in various 
organisms* For example, the detection of abnormalities 
in the sequence, expression, or activity of a cell 
death gene or gene product may provide a useful 
diagnostic for diseases involving cell deaths, 

15 This invention further provides means of altering 

or controlling the activity of a cell death gene in a 
cell, and, thus, affecting the occurrence of cell 
death. Activity of the cell death gene can be altered 
to either increase or decrease cell deaths in a 

2 0 populatidn of cells and, thus, affect the proliferative 

capacity or longevity of a cell population, organ, or 
entire organism. 

Agents which act as inactivators or antagonists of 
the activity of ced-3, ced-4, or other cell death genes 
25 can be used to prevent or decrease cell deaths. Such 
agents are useful for treating (i.e., for both 
preventive and therapeutic purposes) disorders and 
conditions characterized by cell deaths, including 
neural and muscular degenerative diseases, stroke, 

3 0 traumatic brain injury, myocardial infarction, viral 

(e.g., HIV) and other types of pathogenic infections, 
as well as cell death associated with normal aging and 
hair loss. The agent can be delivered to the affected 
cells by various methods appropriate for the cells or 
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organs being treated, including gene therapy. For 
example, anti-sense RNA encoded by all or a part of a : 
cell death gene which is complementary to the mRNA can 
be delivered to a population of cells by an appropriate - 
5 vector, such as a retroviral or adenoviral vector, or 
an antagonist of cell death activity can be infused 
into a wound area to limit tissue damage. 

Methods and agents which cause or increase cell 
deaths are also useful, for example, for treating 
10 disorders characterized by an abnormally low rate or 

number of cell deaths or by excessive cell growth, such 
as neoplastic and other cancerous growth. Such methods 
and agents are also useful for controlling or 
eliminating cell populations, such as cells infected 
15 with viruses (e.g., HIV) or other infectious agents, 
cells producing autoreactive antibodies, and hair 
follicle cells. In addition, methods and agents which 
increase cell death can be used to kill or incapacitate 
undesired organisms, such as pests, parasites and 

2 0 genetically engineered organisms. All or portions of 

ced-3, ce.d-4, or related cell death genes, active 
mutant genes, their encoded products, agents which 
mimic the activity of cell death genes, and activators 
and agonists' of cell death genes can be used for this 
25 purpose. 

For example, cell death genes can be used to kill 
cells infected with the human immunodeficiency virus 
(HIV), and thus, prevent or limit HIV infection in an 
individual. A recombinant gene can be constructed, in 

3 0 which a cell death gene is under the control of a viral 

promoter which is specifically activated by a viral 
protein; the recombinant gene is introduced into HIV 
infected cells. HIV-infected cells containing the 
viral activator protein would express the cell death 
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gene product and be killed, and uninfected cells would 
be unaffected. 

Alternatively, an antagonist of ced-3 or ced-4 
activity (such as antisense RNA) can be expressed under 
the control of a viral-specific promoter and in this 
way, be used to prevent the cell death associated with 
viral (e.g., HIV) infection. 

In another example, cell death genes can be used 
as suicide genes for biological containment purposes. 
Genetic engineering of suicide genes into recombinant 
organisms has been reported in bacteria (Genetic 
Engineering News, Nov. 1991, p. 13): suicide genes 
were engineered to be expressed simultaneously with the 
desired recombinant gene product so that the 
recombinant bacteria die upon completion of their task. 
The present invention provides suicide genes which are 
useful in a variety of organisms in addition to 
bacteria, for example in insects, fungi, and transgenic 
rodents. Suicide genes can be constructed by placing 
the coding sequence of an exogenous cell death gene or 
an agonist of an endogenous cell death gene of the 
organism in an expression vector suitable for the 
organism. 

In addition, agents which increase cell death are 
25 useful as pesticides (e.g., anthelminthics, 

nematicides) . For example, many nematodes are human, 
animal, or plant parasites, ced-3, ced-4, or other 
nematode cell death genes, their gene products, 
mimetics, and agonists can be used to reduce the 
nematode population in an area, as well as to treat 
individuals already infected with the parasite or 
protect individuals from infection. A transgenic plant 
or animal carrying a constitutively activated ced-3 
gene, ced-4 gene, or other cell death gene specific to 
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nematodes can be protected from nematode infection in 
this way. 

The subject invention will now be illustrated by 
the following examples, which are not intended to be 
5 limiting in any way. 2 

EXAMPLE 1 

CLONING. SEQU ENCING AND CHARACTERIZATION OF 
THE CEP- 4 GENE 

MATERIALS AND METHODS 

10 General Methods and Strains 

Techniques used for the culturing of C. elegans 
were essentially as described by Brenner (Genetics 
77:71-94 (1974)). All strains were grown at 20°C. DNA 
was prepared from worms grown on Petri dishes , 
15 containing agarose seeded with E. coli strain HB101. 
RNA was prepared from mass cultures grown in liquid. 
Usually, the bacterial pellet from a 2 L overnight 
culture of E . coli HB101 grown in super broth (12 g 
Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 
20 900 ml H 2 0; after autoclaving, 100 ml 0.17 M KH 2 HP0 4 

and 0.72 K 2 HP0 4 were added) was resuspended in 500 ml S 
basal medium (Brenner, 1974 supra) , and worms were 
added from one or two 10 cm Petri dishes in which the 
bacterial lawns had just been consumed. Worms were 
25 harvested about 4-5 days later by centrif ugation and 
washed in M9 buffer (Brenner, 1974 supra). The yield 
was about 5-10 ml of packed worms. 

Nomarski differential interference contrast 
microscopy was used to examine individual cells in 
3 0 living nematodes (Sulston and Horvitz, Dev. Biol. 
82:110-156 (1977)). Methods for scoring the Ced 
phenotype of ced-1, ced-4 and ced-2; ced-4 double 
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»uta n ts have been described fay £llis 
138:33-41 (1990)). 

5 tw* 8 W " d - type «* «U "utant strains used in . 

« BrZ e " aentS Sje?MS V " iet ^ B ^"tol strain 

listed below. These Mr]cers have ^ described 

Caen 0 rhabd itis elecans. Wood and the Coazmnity of C. 
10 elecans Researchers (eds.,, Cold Spring Harbor 

Laboratory, Hew York, 1988, pp. 491-584; Fi „ ne v et ai 

aV s:75 r 769 (i9s8,) - The strai - »»» t- 

15 S „ ^ alISleS •*'», 
*2347. and «m„ were characterized in the present 

for c eT tiC " 0menCl " U " foll °<« the standard system 
for c. elegans (Horvitz et ai., Afcl . Gen. Genet 
"5:129-133 ^1979)): ' ! 

LG I: cea-i ( ei735), unc~54(r323) 

L= in, uac-s 5^2352;, C ed-^ n i I52/ nl416f 

"1S94, D1920, nl947, B194S, R2247, 
U2273, B1416 nl712, nl416 B1713),' 
mc-7S(el0SS), dpy-i7(el64) 
LG IV: unc-31(s923), ce<3-3(n717) 
25 1/1 V: egl-i ( niB6), unc-7S(e911) 

Genomir r,jh ra rica 

A 4-6 kb size-selected phage library was 
constructed from c«-4 (Dl4 ii, DNA as follows. Genon ic 
0 NA was digested with aindm and run on a low-.eltlng 
3 0 agarose gel. DNA migrating within the 4-6 kb size 
range was excised, and the low-„elting agarose was 
removed by phenol extraction and precipitation 
(Maniatis et al., Mcleculzr cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory (i 9 83),. TheS e 
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DNA fragments were ligated to ffindlll-digested DNA from 
phage A.NM1149 (Murray, Phage Lambda and Molecular 
Cloning, Cold Spring Harbor Laboratory, 1983, pp. 395- 
432). The product DNA was packaged with packaging 
extract from Promega. This library had a total of 
140,000 plague-forming units (pfu) , of which 70% were 
recombinants, as estimated from the ratio of pfu on 
bacteria C600hfl and C600. 

The phage genomic library (provided by J. Sulston) 
was prepared by partial digestion of wild-type C. 
elegrans genomic DNA with Sau3A and cloning into the 
BamHI site of phage vector A.2001 (Karn et al . , Gene 
32:217-224 (1984)). 

Tc4 Probe 

The Tc4 probe used for cloning the ced-4 gene and 
for Southern blots was Tc4-nl352 , which contains a ,Tc4 
element isolated from an unc-86 (nl351) mutant strain 
(Finney et al., Cell 55:757-769 (1988); Yuan et al., 
Proc. Natl. Acad. Sci. USA 88:3334-3338 (1991)). DNA 
20 was labelled with 32 P using either the nick-translation 
procedure described by Maniatis et al . (1983 supra) or 
the oligo-labelling procedure described by Feinberg and 
Vogelstein (Anal. Biochem, 132:6-13 (1983)). 
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RNA Prepar ation , Nort hern Blot and Pri^Pr Extension 

Total c. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 
(1990)). Poly (A) + RNA was selected from total RNA by a 
poly (dT) -column (Maniatis et al . , 1983 supra). To 
prepare stage-synchronized animals, eggs were obtained 
from gravid C. elegans adults grown at 2 0°c in liquid 
culture. A 5 - 10 ml sample of animals was treated 
with 50 ml of NaOCl/NaOH solution (10 ml NaOCl, 1 g 
NaOH, 40 ml H 2 0) for about 10 minutes with vortexing 
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until the adults were dissolved. Eggs were centrifuged 
and washed three times with M9 buffer. Isolated eggs . 
were allowed to hatch in S basal medium without food 
for 14 hours at 20-C with shaking. LI larvae were 
5 collected by low-speed centrifugation after growth on 
E. coin. HB101 for 2 hours, L2 larvae after 12 hours, L3 
larvae after 24 hours, L4 larvae after 36 hours and 
adults after 48 hours. Northern blot analysis using 
DNA probes was performed essentially as described by 
> Meyer and Casson (Genetics 105:29-44 (1986)), except 
that RNA was transferred from the gel to the Gene 
Screen filter (DuPont, Wilmington, DE) by capillary 
action. 

Quantitation of ced-4 expression during embryonic 
development was done by hybridizing two duplicate 
northern blots with ced-4 cDNA clone SK2-2 and with a 
genomic DNA clone for the actin 1 gene, pW-16-210 
which hybridizes to the 3' untranslated region of 'the 
actm l transcript (Krause and Hirsh, in: Molecular 
Biology of the Cytoskeleton , Borisy et al . (eds.), Cold 
Spring Harbor Laboratory, 1984, pp. 287-292). The two 
probes were of the same specific activity (4 x 10 8 
counts/minute/ M g) . The emission of fl particles from 
the ced-4 and' actin 1 bands was counted using a B 
counter (Betagen, Waltham, MA). The readings were 7 7 " 
counts/minute for the actin 1 band and 1.4 counts/ 
minute for the ced-4 band. 

The primer extension protocol was that of Sambrook 
etal. (Molecular Cloning: A Laboratory Manual , 2nd 
edition, cold Spring Harbor Laboratory, 1989, pp. 7.79- 
7.83), using the primer ATTGGCGATCCTCTCGA (Seq. ID 
#22). to define the lengths of the reaction products, 
a sequencing reaction using this primer and C10D8-5 as 
template was run adjacent to the product of the primer 
extension reaction in the sequencing gel. 
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hybrid 6 • di " Ctl0n ° f transcri P tio " — determined by 
hybr ldlzlng northern Wots with single . stranded 

5 L » ribe plasmid staining ^ . nsert 

Wt " hl *. ClSaVed at «• « the other end of the 
£T« T Uneari " d was transcribed using 

Zo7Z b * P T e " Se - " Spe «^< *-«tin g pna 
10 f r ' ThSSe *»* P^°du=ts were used to 

b y 8 LTT blots according to a — 01 d — - 

«* formal tabr °* ! '"^idized in 

d! 7° 6 S ' °- 5% SDS ' 250 ^ •»!"«« sper* 

L5 in / x Ss ^ en 0 h ^ ridi2ed " ith «* «'C «- -shed 

each end then in 2 x SSC , o.!% SDS once at SO'C for 20 
-nutes. northern Wot experiments showed that the 
single-stranded UNA probe transcribed by T3 rna ! 

0 ^r^r 6 hybridl2ed t0 th * *•* ». ceo-, mRNA, while 
0 the probe made by T7 RNA polymerase did not. ihis 
result indicates that the direction of the 

srns.:* from the bmhi — - ™ 

Determination of nw A Seg iiP.n. 

' ««e llV^t^ "** "*>™: deletions 
were made according to Henikoff ( Ce ne 28:351-359 

U984), DNA sequences were determined using Seguenase 
an protocols obtained from D s Biochemicals 'clev'ela d 

• The ced - 4 DNA seance was confirmed by 
sequencing both strands of cDNA anrt • 

cuma and genomic DNA clones. 



C loning of the f^jJ Fra<m a n<- n ^„ . 

'co*t*T C ° Smi * C10D8 di9e " eCi With *" R1 " 

-coHI fragments of 2.2 *b (r5, and 2.4 *b (r7) , both of 
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which hybridized to a mixture of ced-4 cDNA subclones 
SK2-1 and SK2-2, were isolated. r7, which hybridized 
to SK2-1, which contains the 3' half of ced-4 cDNA 
clone SK2 , was cloned into the EcoKL site of plasmid 
pBSKII (Stratagene) . The EcoRI site at the 3' end of 
r7 was deleted by digesting with Styl, which cut once 
at 0.2 kb from the 3' end of the insert, and Sail, 
which cut once in the polylinker, and then religating. 
The deleted r7 plasmid was linearized with EcoRI and 
ligated with ScoRI-digested r5, which hybridized to 
Sk2-2, the 5' half of ced-4 cDNA SK2. Clones were 
analyzed for the correct orientation of the r5 insert 
based on the cDNA restriction map. One such correctly 
oriented clone was named C10D8-5. 



15 Microinjection * n d Transformation 

The procedure for microinjecting DNA into the-, 
gonad to obtain germline transf ormants was basically 
that of Fire (EMBO J. 5:2673-2680 (1986)) with 
modifications introduced by J. Sulston. Cosmid DNA to 

2 0 be injected was purified twice using CsCl-gradient 

centrifugation (Maniatis et al . , 1983 supra). Plasmid 
DNA to be injected was prepared by alkaline minipreps 
(Maniatis efal., 1983 supra). DNA was treated with 
RNAase A (37 °C, 3 0 minutes) and then with proteinase K 
25 (55°C, 30 minutes), extracted with phenol and then 

chloroform, precipitated twice (first in 0.3 M sodium 
acetate and then in 0.1 M potassium acetate, pH 7.2), 
and resuspended in 5 ul of injection buffer (Fire, 1986 
supra) . DNA concentrations used for injection were 

3 0 0.1-1.0 mg/ml. 

All transformation experiments used a ced-l; 
ced-4 (nll62) ; unc-31 strain as the recipient. The 
expression of the Ced-4 phenotype was quantified by 
counting the number of cell corpses in the heads of 
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voung 11 annals. The cosmid C10D8 or plasmid 
subclones of C10DS were mixed with cosmid C14G10, which 
centals the wild-type unc .„ (+) ^ , t ^ ^ ^ 
2. -1 or 3. -a to increase the likelihood that a 
Phenotypically non-Un= transfcrmant would contain the 
cosmid or plasmid being tested. Generally, 20-30 
animals were injected in one experiment. Non-Unc Fl 
progeny of injected animals were isolated three to four 
days later. About 1/2 to 1/3 of the non-Unc progeny 

could be established as lines of transf ormants . Lung 

non-Unc progeny of such non-Unc transf ormants were 
examined using Nomarski optics to determine the number 
of cell corpses present in the heads. 

15 £ ^-4 rv*ion prote m an* ant ibo^ c ,.. r . r . M _ n 

To express a Ced-4 fusion protein in E. coli, a 

C n!r, COntainln9 b ° th ^ " ^ 3 ' halves «* the ceo--, 
cDNA (SK2-2 and SK2-1) in the expression vector pET- 5 a 
(Rosenberg et al., Gene SSU25-135 (1987)) was 
20 constructed. The fusion protein expressed by this 

vector was expected to include 11 amino acids of phage 
T7 gene 10 protein, 5 amino acids of linker and the 546 
ammo acids encoded by ced-4 cDNA SK2 . The pJ76 
Plasma, which encodes this fusion protein, was 
transformed into bacterial strain BL21. C ed-4 fusion 
protein was produced by this transformed strain as 
expected, and subjected to electrophoresis on a' 
polyacrylamide gel. A band, with mobility equivalent 
to about 64 x i 0 3 Mr, specific to the transformed 
stram was exercised and used to immunize three 
rabbits, sera from all three rabbits tested positive 
on western blots (Towbin et al . , Pr0 c. Acad. 5<ri 

VSA 75:4350-4354 (1979),. These sera were purified 
using immunoblots (Harlow and Lane, Antibodies- A 
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Laboratory Manual, Cold Spring Harbor Laboratory, 
1988) . 



RESULTS 

Cloning of the ced-4 R ene- bv Transposon Tag ging 
5 The ced-4 allele nl416 in the C. elegans strain 

TR679 was isolated, which carries the mutator mut- 
2(r459) and shows an elevated frequency of 
transposition elements (Collins et al., Nature 328:726- 
728 (1987); Yuan et al., Proc. Natl. Acad. Sci. USA 
10 88:3334-3338 (1991)). The ced-4 (nl416) mutation is 
closely linked to a newly transposed copy of the C. 
elegrans transposon Tc4 (Yuan et al., 1991 supra). 
Using Tc4 as a probe, this novel Tc4 element and its 
flanking region was cloned as a 5 kb ffindlll fragment 
15 from a 4-6 kb size-selected ced-4 (nl416) genomic phage 
library. A 3 kb adjacent to this Tc4 element was 
isolated by digesting the 5 kb tfindlll fragment with 
BamHI. This 3 kb fragment, called pnl416, was cloned 
into the Bluescribe M13+ plasmid vector (Stratagene) . 

2 0 when used as a probe on Southern blots, pnl4l6 

hybridized to. a 3.4 kb ffindlll fragment in DNA of wild- 
type (strain N2) and two non-Ced revertants of 
ced-4(nl416) , ced-4 (nl416 nl712) and ced-4 (nl416 nl713) 
(Yuan and Horvitz, Dev. Biol. 138:33-41 (1990)), and a 
25 5 kb HindUI fragment in ced-4 (nl416) animals. The 
hybridizing band in ced-4 (nl416) DNA is 1.6 kb larger 
than that of the wild-type or the revertants, 
indicating that an insertion of this size is present in 
the ced-4 (nl416) strain and is deleted in both 

3 0 revertants. These observations indicate that the Tc4 

insertion in ced-4 (nl416) animals is responsible for 
their Ced-4 mutant phenotype and suggest that pnl4i6 
contains at least part of the ced-4 gene. 
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To isolate additional genomic DNA from the region 
of this Tc4 insertion, pnl4l6 was used to probe a c. 
elegans Bristol N2 genomic DNA phage library. Five 
phage clones with inserts of 10 to 
5 15 kb were isolated and shown to share a 3 kb BajnHI- 
flijidlll fragment that hybridized to pnl416. These 
phage clones were used to identify cosmids that 
hybridized to them and that were members of a 600 kb 
contig of overlapping cosmids (Coulson et al . , Proc. 
10 Natl. Acad. Sci. USA 83:7821-7825 (1986)). By using 
the phage clones as probes to hybridize to Southern 
blots, a cosmid C10D8 was identified as containing all 
regions of genomic DNA present in all five phage clones 
and in pnl.416. 

15 The ced-4 Mutan t Phenotype Can Be Rescued bv a 4.4 kb 
DNA Fragment 

To identify ced-4 (+) DNA capable of complementing 
the Ced-4 mutant phenotype, the cosmid C10D8 was 
injected into the oocytes of ced-4 (nll62) animals. To 

20 facilitate the identification of transgenic animals, a 
mutation in the unc-31 gene, which affects locomotion, 
was included as a marker for co-transformation (Kim and 
Horvitz, Genes & Dev. 4:357-371 (1990)). Cosmid 
C14G10, which contains the wild-type allele of unc-31 

25 and does not have Ced-4-rescuing activity was 
coinjected with cosmid C10D8 into ced-l (el735) ; 
unc-31 (e928) ; ced-4 (nll62) animals. The ced-l mutation 
was included to facilitate the scoring of the ced-4 
mutant phenotype (Ellis and Horvitz, Cell 44:817-829 

30 (1986)). Specifically, when a cell undergoes 

programmed cell death in C. elegrans, its corpse is 
quickly engulfed and destroyed by a neighboring cell 
(Robertson and Thomson, J. Embryo 1 . Exp. Morph. 67:89- 
100 (1982); Sulston et al . , Dev. Biol. 100:64-119 
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(1983)). A ced-2 mutation prevents this engulfment, 
allowing the cell corpse to remain intact (Hedgecock et 
al., Science 220:1277-1280 (1983)). Thus, in a first 
or second stage (LI or L2) ced-1 mutant larva, many 
5 cell corpses are present and can be easily visualized 
using Normaski optics, ced-4 mutations prevent cell 
death and the appearance of these corpses. Thus, 
suppression of the Ced-4 mutant phenotype by a wild- 
type ced-4 gene can be observed and readily quantified 

10 in a ced-1 mutant background based on an increase in 
the number of visible cell corpses. 

From one such microinjection experiment, three 
non-Unc animals rescued for the Unc-31 mutant phenotype 
were picked from among the Fl progeny, and from one of 

15 them a line of non-Unc transf ormants was obtained. No 
true-breeding non-Unc animals could be isolated from 
this line: about 25% of the progeny of all non-Unc, 
animals were, Unc. Since no inviable zygotes were 
observed among the progeny of these non-Unc animals, 

2 0 this transf ormant did not carry a recessive lethal 

insertion mutation. Rather, it seems likely that the 
injected DNA was maintained as an extrachromosomal 
array that was segregated to only some gametes, as has 
been reported previously for many other C. elegans 
25 transgenic strains (e.g., Stinchcomb et al., Mol. Cell 
Biol. 82:110-156 (1985); Fire, EMBO J. 5:2673-2680 
(1986); Way and Chalfie, Cell 54:5-16 (1988)). This 
putative extrachromosomal array was named nExl. Young 
LI progeny of n^l-containing animals were examined 

3 0 using Nomarski optics for the Ced-4 phenotype. 

Young LI ced-1 animals have an average of 23 cell 
corpses in the head, while ced-1 (el735) ; ced-4 (nll62) 
animals have an average of 0.6 cell corpses (Ellis and 
Horvitz, Cell 44:817-829 (1986)). Young LI ced-1; 
35 ced-4 (nll62) ; nExl animals had an average of nine cell 
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corpses in the head. These results indicate that 
cosmid C10D8 restored significant, but not total, 
ced-4 (+; activity in the transf ormants . 

To delineate the ced-4 gene within C10D8, various . 
5 subclones of C10D8 were injected into ced-4 mutant 
animals and tested for their ability to rescue the 
Ced-4 mutant phenotype (Table 1) . The smallest 
subclone plasmid that could rescue the ced-4 phenotype 
as effectively as cosmid C10D8 was a 4.4 kb fragment, 
10 called C10D8-5. C10D8-5 and the unc-31 (+) -containing 
cosmid C14G10 were coinjected into ced-l; unc-31; 
ced-4 (1x1162) animals. Two lines' of non-Unc 
transformants were isolated. Since these animals 
continued to segregate Unc animals and did not produce 
15 inviable zygotes, both appeared to carry 

extrachromosomal arrays, which were designated nEx7 and 
nExa . Young LI animals from these transf ormant strains 
had an average of 11.5 cell corpses in their heads, 
indicating that plasmid C10D8-5 restored ced-4 (+) 
20 activity as well as did cosmid C10D8 (Table 1). 

Identification of a ced-4 Transcript 

Restriction sites of plasmid C10D8-5 .(which can 
rescue the Ced-4 phenotype) and pnl416 (which contains 
sequences adjacent to the Tc4 insertion site) were 
2 5 mapped. C10D805 was found to overlap with 2 kb of 
sequence in pnl416, including the Tc4 insertion site 
(Figure 8) . 

In Northern blot experiments, both pnl416 and 
C10D8-5 were used to probe poly (A) + RNA populations of 
30 mixed developmental stages of wild-type (strain N2), 
ced-4 (nl416) , and ced-4 (n!416 nl712) and ced-4 (nl416 
nl713) revertant animals. pnl416 hybridized to a 2.2 
kb transcript and an 0.9 kb transcript in RNA from N2 
animals, and a 3 kb transcript, a transcript slightly 
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larger than the wild-type 2.2 kb transcript, and a 
transcript slightly smaller than the wild-type 0.9 kb 
transcript in ced-4 (nl416) animals. The 3.8 kb RNA 
contained Tc4 sequence (see below) , suggesting that 
5 this RNA resulted from the insertion of the 1.6 kb Tc4 
sequence into the ced-4 sequence encoding 2.2 kb 
transcript. The transcript slightly larger than the 
2.2 kb wild-type transcript did not contain Tc4 
sequence. This ced-4 (n!416) RNA might have been an 
10 aberrant transcript containing sequences adjacent to 
the ced-4 gene: when pnl416 was used as a probe, the 
wild-type 2.2 kb and the slightly larger transcript in 
this mutant were relatively similar in intensities, 
whereas when ced-4 cDNA clone SK2-1 was used as a 
15 probe, this mutant transcript was not detected (see 
below) . These observations indicate that the 
ced-4 (nl416) 2.2 kb transcript contains sequences from 
the ced-4 region but does not contain sequences 
corresponding to at least the 3' half of the ced-4 
20 mRNA. The two revertants of ced-4 (nl416) , ced-4 (nl416 
nl712) and ced-4(nl416 nl713) , contained both 2.2 kb 
and 0.9 kb transcripts with similar sizes to the wild- 
type transcripts. Thus, both the 2.2 kb and the 0.9 kb 
transcripts were altered in ced-4 (nl4 16) animals, and 

2 5 both were restored in the two non-Ced revertants. 

To determine if any of the transcripts contains 
Tc4 sequence, the Northern blots were probed with 
Td-nl351, which contains the 1.6 kb Tc4 element 
present in the Tc4 -induced mutant unc-86 (nl35i) as well 

3 0 as 4 kb of unc-8 6 sequences. Tc4 -n!351 hybridized both 

to a 3.8 kb transcript of the Tc4-induced mutant 
ced-4 (nl.416) and to a 1.5 kb unc-68 transcript in both 
ced-4 (n!416) and N2 animals. 

To determine whether one or both of the 2.2 kb and 
3 5 0.9 kb. transcripts are encoded by ced-4, subclone 
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C10D8-5, which rescued the Ced-4 phenotype, was used to 
probe the Northern blots. C10D8-5 detected the wild- : 
type 2.2 kb transcript, the ced-4 (nl4l6) transcript 
slightly larger than the 2.2 kb transcript, and the 
5 ced-4(n!416) 3.8 kb transcript. C10D8-5 did not 
hybridize to the 0.9 kb transcript, indicating that 
this transcript is unlikely to be encoded by ced-4. 
C10D8-5 also detected a 1.4 kb transcript, which was 
not altered by the Tc4 insertion in ced-4 (nl416) 
10 animals, only a 470 bp ZcoRI-stuI fragment at one end 
of C10D8-5 hybridized to this 1.4 kb RNA. since 
C10D8-5 did not contain the complete coding region for 
this RNA, and since this RNA was unaffected in 
ced-4 (U1416) animals, this 1.4 kb RNA seems unlikely to 
15 be a ced-4 transcript. The relationships among cosmid 
C10D8-5, pnl4l6 and the 0.9 kb, 1.4 kb and 2.2 kb 
transcripts are summarized in Figure 8. 

On Northern blots probed with the ced-4 cDNA clone 
SK2-1, the level of the 2.2 kb transcript showed 
20 significant reduction in all three independently 

derived EMS-induced ced-4 mutants examined, strongly "~ 
supporting the hypothesis that this 2.2 kb transcript 
is a ced-4 transcript. Total RNA from N2, 
ced-4 (nll62) ,■ ced-4 (nl416) , ced-4 (nl894) and 
25 ced-4 (nl920) eggs was probed with 32 P-labelled ced-4 
cDNA SK2-1. An actin 1 probe (Krause and Hirsh, in: 
Molecular Biology of the Cytoskeleton , Borisy et al . 
(eds.), Cold Spring Harbor Laboratory, 1984, pp. 287- 
292) was used as an internal control for the amount of 
RNA loaded in each lane. The ratios of the intensity 
of the ced-4 band to that of actin band in N2 f nil 62, 
nl416 and nl894 were 0.5, 0.17, 0 and 0.12, 
respectively, a Northern blot of poly (A) + RNA from 
stage-synchronized animals was probed with pnl4i6 
which hybridizes both to the 2.2 kb ced-4 transcript 
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and to a 0.9 kb transcript. The 0.9 kb transcript 
seems to be expressed mostly in eggs and adults. The 
presence of RNA in all lanes was confirmed by loading 
1/10 of each sample on another gel and probing a 
5 Northern blot from this gel using the C. elegans actin 
1 gene (Krause and Hirsh, 1984 supra) . That all of 
these distinct ced-4 mutations cause reduced levels of 
a ced-4 transcript could reflect either instability of 
all three mutant transcripts or a role for ced-4 in 
10 regulating its own expression. 

Based upon these results, it can be concluded that 
the 2.2 kb RNA is a ced-4 transcript. It is not known 
why the 0.9 kb RNA. is also altered in ced-4 (nl416) 
animals. Perhaps transcription of the 0.9 kb RNA is 
initiated incorrectly as a consequence of the nearby 
Tc4 element. 



15 



ced-4 Expressio n is Primarily Embryonic 

A Northern blot containing RNAs from stage- 
synchronized animals of different developmental stages 

20 probed with pnl416 showed that the 2.2 kb ced-4 
transcript was expressed primarily during 
embryogenesis. This result is consistent with the 
observation that 113 of the 131 programmed cell deaths 
in the c. elegans hermaphrodite are embryonic (Sulston 

25 and Horvitz, Dev. Biol. 82:110-156 (1977); Sulston et 
al., Dev. Biol. 100:64-119 (1983)). The 2.2 kb RNA was 
relatively abundant during embryonic development. The 
0.9 kb transcript was expressed mostly in eggs and 
adults. The presence of RNA in all lanes was confirmed 

3 0 by loading 1/10 of each sample on another gel and 
probing a Northern blot from this gel with the C. 
elegans actin 1 gene (Krause and Hirsh, 1984 supra) . 
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The ced -4 Transcript is Present in a ced-3 Mutant 

The activities of both ced-3 and ced-4 are 
required for programmed cell death (Ellis and Horvitz, 
Cell 44:817-819 (1986)). One possibility is that one 
5 of these genes positively regulates the expression of 
the other. For this reason, a Northern blot of wild- 
type strain N2 and ced-3 (n717) poly (A) + RNA was probed 
with pn!416. This experiment showed that the 2.2 kb 
ced-4 transcript was present at an apparently normal 
10 level in this ced-3 mutant. Thus, the activity of the 
ced-3 gene is unlikely to be necessary for the 
expression of the ced-4 2.2 kb transcript. 

Identification of ced-4 cDNA Clones 

To isolate cDNA clones of ced-4, pnl416 was used 

IS to probe a C. elegans cDNA phage library made from 
wild-type strain N2 mixed-stage RNA (Kim and Horvitz, 
Genes & Dev. 4:357-371 (1990)). Two cDNA clones were 
isolated. The two cDNA clones (named SKI and SK2) 
hybridized to the 2.2 kb ced-4 transcript. Both are 

20 about 1.8 kb in size, and both contain one 0.8 kb and 
one 1.0 kb EcoRI fragment. These -EcoRI fragments were 
subcloned into plasmid vector Bluescribe M13+ 
(Stratagene) . The two subclones derived from SKI were 
named SK1-1 and SK1-2, and the two subclones derived 

25 from SK2 were named SK2-1 and SK2-2. The restriction 
maps of the SKI- and SK2 -derived clones were the same. 
Sequence analysis of the ends of the four cDNA 
subclones confirmed the equivalence of the SKI and SK2 
clones, except that SK1-2 contains a poly (A) sequence 

3 0 of more than 50 bp at its 5 ' end. This poly (A) 

sequence is probably a cDNA cloning artifact, since 
SK1-2 contains the 5' half of the cDNA (see below). 





The ced -4 Sequence 

The DNA sequence of the SK2 1.8 kb cDNA clone was . 
determined. This sequence includes an open reading 
frame encoding 546 amino acids (Figure 1; Seq. ID #2), 
5 which is consistent with the results of Northern blot 
analysis using single-stranded RNA probes. An ochre 
termination codon (TAA) is located in-frame near the 3' 
end, indicating that the 3' end of the 2.2 kb 
transcript is most likely included in this cONA. The 

10 open reading frame extends to the 5' end of the 1.8 kb 
cDNA, suggesting that this cDNA might lack the 5' end 
of the ced-4 coding region. 

A primer extension experiment was performed to 
determine the ced-4 transcription initiation site(s) 

15 using the primer ATTGGCGATCCTCTCGA (Seq. ID #23) and 
C10D8-5 as template. A major transcriptional 
initiation site was identified at 54 bp before (5' of) 
the beginning of the ced-4 cDNA SK2 and a minor 
initiation site at 54 bp after (3' of) the beginning of 

20 this cDNA (Figure 1) . The first AUG.- codon after the 
presumptive major start site is located at 9 bp before 
the beginning of the cDNA (Figure 1) . If this site is 
used to initiate protein synthesis, the Ced-4 protein 
would be 549 -"amino acids in length. The first AUG 

25 codon after the presumptive minor start site is located • 
at 13 0 bp after the beginning of the cDNA. If this 
site is used, the Ced-4 protein would be 503 amino 
acids in length. Preliminary results using an anti- 
Ced-4 antibody raised against a Ced-4 fusion protein 

3 0 showed that endogenous Ced-4 protein is slightly 

smaller in molecular weight than a Ced-4 fusion protein 
of 562 amino acids expressed in E. coli. Thus, most 
Ced-4 ptotein is probably initiated near the start of 
the cDNA and is presumably 549 amino acids in length 

3 5 and 62,977 in relative molecular mass. The direction 
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of the open reading frame is consistent with the 
direction of transcription, as demonstrated by probing 
Northern blots with single-stranded RNA probes. The 
presumptive Ced-4 protein is highly hydrophilic, with a. 
5 pi of 5.12. The longest hydrophobic region is a 
segment of 12 amino acids from residues 382 to 393. 

A Western blot of wild-type strain N2 mixed-stage, 
ced-4 (nl416) mixed-stage, wild-type egg, and 
bacterially expressed protein (pJ76) was probed using 
10 anti-Ced-4 antibody. Ced-4 fusion protein (pJ76) was 
made by cloning ced-4 cDNA SK2 into the T7 expression 
vector pET-5a (Rosenberg et al./Gene 55:125-135 
(1987)), so that 546 amino acids of Ced-4 sequence were 
fused to 11 amino acids of T7 gene 10 protein and 5 
15 amino acids of linker sequence. This Ced-4 fusion 
protein is similar in relative molecular mass to the 
endogenous Ced-4 protein, which is present in wild-type 
(N2) but missing in ced-4 (n!41 6) animals . The proteins 
phosphorylase b, 97 x 10 3 ; bovine serum albumin, 66 x 
20 10 3 (Hirayama et al., Biochem. Biophys, Res. Comm. 

173:639-646 (1990)); and ovalbumin, 43 x 10 3 , were used 
as molecular weight standards. 

To confirm the DNA sequence obtained from the 
ced-4 cDNAs and to study the structure of the ced-4 
25 gene, the sequences of the 4.4 kb cosmid subclone 

C10D8-5, the 3 kb insert pnl416, and the 2 kb ITindlll- 
BamHI fragment that contains the Tc4 insertion in the 
ced-4 (nl416) mutant were determined. Comparison of the 
ced-4 genomic and cDNA sequences revealed that the 
3 0 ced-4 gene has seven introns of sizes ranging from 44 
bp to 557 bp (Figure 2) . The exon sequences of genomic 
clone C10D8-5 are identical to the sequences of ced-4 
cDNA Comparison of the Tc4 insertion site in 

ced-4 (n!416) DNA with the ced-4 (+) genomic and cDNA 
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sequences indicated that Tc4 inserted into an exon in 
the ced-4 gene in ced-4 (nl416) animals (Figure 2). 

The DNA sequences of eight EMS-induced ced-4 
alleles were also determined (Table 2) . one of the 
eight, nl94B , is a aissense mutation. Of the seven 
others, four create stop codons and three are predicted 
to affect splicing of the ced-4 transcript. The 
positions of these mutations are indicated in Figure 2. 
These findings indicate that the phenotypes of these 
mutants (Ellis and Horvitz, Cell 44:817-829 (1986)) 
result from a complete loss of ced-4 gene function. 
These mutations establish the null phenotype of the 
ced-4 gene, confirming that ced-4 function is not 
essential for viability. 

15 The Ced-4 Protein Has Two ^n, e c imi1aT . + 0 v „ n ^ 
Calcium -Binding Domains 

By direct inspection, the sequence of the putitive 
Ced-4 protein was compared with the consensus sequence 
of the calcium-binding loop of the EF-hand domain 
(Tufty and Kretsinger, Science 187:161-171 (1975); 
Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 
52:499-510 (1987); Szebenyi and Moffat, J". Biol . Chem. 
2S:8761-8777''(1986)). Two regions of the Ced-4 protein 
were identified that might bind calcium (Figure 3). 

The EF-hand is a 29 amino acid domain consisting 
of a helix-loop-helix region, with the loop portion 
(residues 10-21) coordinating calcium-binding via the 
side-chain oxygens of serine, threonine, asparagine, 
aspartic acid, glutamine or glutamic acid. These 
residues occur at five of the vertices of an 
octahedron: X (position 10), Y (12), Z (14), -x (18), 
-Z (21) . EF-hand amino acid sequences vary 
considerably in the residues present in the calcium- 
binding loop (Figure 3), and some EF-hand domains have 
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only one helical region (Xretsinger, 1987 supra). The 
consensus sequence is shown at the top of Figure 3. 
Positions Y, Z, and -X can have any of a number of 
amino acids which have oxygen-containing side chains. 
Position X is usually aspartic acid, and position -Z is 
usually glutamic acid. 

The sequences of parvalbumins from carp muscle 
(Seq. ID #3; Nockolds et al., Proc. Natl. Acad. Sci. 
USA 65:581-584 (1972)), the intestinal calcium-binding 
protein (iCaBP) (Seq. id #7-8; Szebenyi et al., Nature 
254:327-332 (1981)), troponin C (Seq. . ID #9-12; Collins 
et al., FEBS Lett. 36:268-272 (1973)) and calmodulin 
(Seq. ID #13; Zimmer et al . , J. Biol. Chem. 263:19,370- 
19,383 (1988); Babu et al., Nature 315:37-40 (1985)) 
15 show canonical EF-hands. The hake and ray parvalbumins 
(Seq. ID #4-5; Capony et al . Eur. J. Biochem. 32:97-108 
(1973)); Thatcher and Pechere, Eur. J. Biochem. 75:121- 
132 (1977)), sarcoplasmic calcium-binding protein • 
(SCBP) from 'the protochordate Amphioxus (Seq. id #6; 
20 TaJcagi et al., Biochemistry 25: 3585-3592 (1986)), 
trypsinogen (Seq. ID #14; Bode and Schwager, J. Mol. 
Biol. 58:693-717 (1975)), fibrinogen (Seq. ID #15; 
Doolittle, Ann. Rev. Biochem. 53:195-229 (1984); Dang 
et al., J. Biol. Chem. 260:9713-9719 (1985)), villin 
25 (Seq.. ID #16; Hesterberg and Weber, J. Biol. chem. 

258:365-369 (1983)) and galactose-binding protein (GBP) 
(Seq. ID #17; Vyas et al., Nature 327:635-638 (1987)) 
show variations from the consensus sequence. GBP does 
not contain the helices of the EF-hand. 
3 0 The potential calcium-binding loops of sequence 1 

and sequence 2 are located at amino acids 77-88 and 
amino acids 292-303 of the Ced-4 protein, respectively 
(Figure 3) . In its putative calcium-binding loop, the 
first- potential EF-hand-liJce sequence of the Ced-4 
35 protein has four (positions Y, Z, -X, -Z) of the five 
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conserved residues with oxygen-containing side chains 
(shown in bold) , and the fifth position (X) has a 
tyrosine rather than an aspartic acid; tyrosine 
contains oxygen in its side chain. The second 
5 potential EF-hand-like sequence of the Ced-4 protein 
has three residues (positions Z, - X/ -2) that match the 
consensus sequence, and amino acids with oxygen- 
containing side chains at the other two positions. 
These observations suggest that these two regions of 
the Ced-4 protein might bind calcium. Like the Ced-4 
protein, a number of known calcium-binding proteins, 
such a bovine intestinal calcium-binding protein 
(ICaBP) (Szebenyi and Moffat, 1986 svpra) , rabbit 
troponin c (Collins et aJ . , 1973 supra), trypsinogen 
and villin (Doolittle, 1984 supra; Danget et al., 1985 
supra) have only three or four conserved residues at 
these five positions (Figure 3). The EF-hand domains 
in ICaBP and troponin C have been shown by x-ray ' 
crystallography to bind calcium. 

One. major difference between the Ced-4 protein and 
the calcium-binding loop of the EF-hand consensus 
sequence is at position 15. Here, the two Ced-4 
sequences have a histidine and a glutamic acid, 
respectively- whereas most ET-hand-containing proteins 
have a glycine; this glycine has been suggested to be 
important for the turning of the loop (Kretsinger, 1987 
supra). However, a histidine is present at this 
position in a parvalbumin and an aspartic acid is 
present in another parvalbumin and also in a 
sarcoplasmic calcium-binding protein (Kretsinger, 1987 
supra) (Figure 3). Thus, the presence of histidine or 
glutamic acid at position 15 does not rule out the 
possibility that these regions bind calcium. 

The calcium-binding loop (positions 10-21) of the 
EF-hand is thought to be preceded (positions 1-9) and 
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followed by alpha-helical domains (positions 22-29) 
(Kretsinger, 1987 supra) . Since position 3 of Ced-4 
sequence l and positions 26 and 28 of Ced-4 sequence 2 
are prolines, these regions might not form alpha- 
helices. However, the known calcium-binding protein 
galactose-binding protein (GBP) has a calcium-binding 
domain similar to that of the EF-hand (Figure 3) but 
without the two helices; furthermore, position 29 of 
GBP is proline (Vyas et al . , 1987 supra). Thus, the 
Ced-4 protein need not contain such alpha-helical 
calcium-binding domains. 

Based upon these considerations, it seems likely 
that the ced-4 protein binds calcium or a similar 
divalent cation. 



EXAMPLE 2 

.CLONING , SEQUENCING. AND CHap&r T ERTZATTntt nr 

THE CED-3 GFWT; 

MATERIALS AND METHODS 

General Methods « n d Strains 

The techniques used for the culturing of c. 
elegans were as described by Brenner (Genetics 77:71-94 
(1974)). All strains were grown at 20 °c. The wild- 
type parent strains were C. elegans variety Bristol 
strain N2, Bergerac strain EM1002 (Emmons et al . , Cell 
32:55-65 (1983)), c. briggsae and C. vulgaris (obtained 
from V. Ambros) . The genetic markers used are 
described below. These markers have been described by 
Brenner (1974 supra), and Hodgkin et al. (in: The 
Nematode Caenorhabditis elegans, Wood and the Community 
of C. elegans Researchers (eds.), Cold Spring Harbor 
Laboratory, 1988, pp 491-584). Genetic nomenclature 
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follows the standard system (Horvitz et al., Hoi . Gen. 
Genet. 275:129-133 (1979)). 

LG I: ced-l(el375) ; unc-54(r323) 
LG VI: unc-31(e928), unc-30 (el91) , ced-3(n717, n71B, 
nl040, nll29, nll63, nll64, nll65, nl286, 
nl949, n2426, n2430, n2433) , unc-26 (e20S) , 
dpy-4 (ell66) 
LG V: egl-i(n986) ; unc-76(e911) 
LG X: dpy-3(e27) 

10 Isolation of Addit ional Alleles of ced-i 

A non-complementation screen was designed to 
isolate new alleles of ced-3. Because animals 
heterozygous for ced-3 (n717) in trans to a deficiency 
are viable (Ellis and Horvitz, Cell 44:817-829 (1986)), 
animals carrying a complete loss-of-f unction ced-3 
allele generated by mutagenesis were expected to be 
viable in trans to ced-3 (n717) t even if the new allele 
was inviable in homozygotes. Fourteen EMS mutagenized 
egrl-l males were mated with ced-3 (n7 17) unc-26 (e205) ; 
20 egl-l(n487); dpy-3(e27) hermaphrodites, egl-l was used 
as a marker in this screen. Dominant mutations in 
egrl-i cause the two hermaphrodite specific neurons, the 
HSNs, to undergo programmed cell death (Trent et al., 
Genetics 104:619-647 (1983)). The HSNs are required' 
for normal egg-laying, and egl-i(n986) hermaphrodites, 
which lack HSNs, are egg-laying defective (Trent et 
a2., 1983 supra). The mutant phenotype of egl-l is 
suppressed in a ced-3; egl-l strain because mutations 
in ced-3 block programmed cell deaths, egl-l males 
were mutagenized with EMS and crossed with ced-3 (n717), 
unc-26 (e205); egl-l (n487) ; dpy-3 (e27) . Most cross 
progeny were egg-laying defective because they were 
heterozygous for ced-3 and homozygous for egl-l. -Rare 
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egg-laying competent animals were picked as candidates 
for carrying new alleles of ced-j. Four such animals 
were isolated from about l 0/ 000 Fi cross progeny of 
EMS-mutagenized animals. These new mutations were made 
homozygous to confirm that they carried recessive 
mutations of ced-3. 

Molecular Fj»T ?Tr 

Standard techniques of molecular biology were used 
(Maniatis it ml.. Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, 1983). 

Two cosmid libraries were used extensively in this 
work: a Sau3AI partial digest genomic library of 7000 
clones in the vector P HC79 and a Sau3AI partial digest 
genomic library of 6000 clones in the vector pJB8 (ish- 
15 Horowicz and Burke, Nucleic Acids Res. s ;2 989 (1981)). 

The "right" end of MMM-Cl was cloned by cutting it 
with Hindlll and self-ligating. The "left" end of- 
MMM-Cl was cloned by cutting it with Bgliz or Sail and 
self-ligating. 

The "right" end of Jc8 was made by digesting Jc8 
with EcoRI and self-ligating. The "left" end of Jc8 
was made by digesting Jc8 by Sail and self-ligating. 

C. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4-357-371 
25 (1990)). Poly(A) + RNA was selected from total RNA by a 
poly(dT) column (Maniatis et al., i 983 supra). To 
prepare stage-synchronized animals, worms were 
synchronized at different developmental stages (Meyer 
and Casson, Genetics 106:29-44 (1986)). 

For DNA sequencing, serial deletions were made 
according to a procedure developed by Henikoff (Gene 
28:351-359 (1984)). DNA sequences were determined 
using- Sequenase and protocols obtained from us 
Biochemicals with minor modifications. 
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The Tel DNA probe for Southern blots was pCe2001, 
which contains a Bergerac Tel element (Emmons et al., 
Cell 32:55-65 (1983)), Enzymes were purchased from New 
England Biolabs, and radioactive nucleotides were from . 
5 Amer sham . 

Primer extension procedures followed the protocol 
by Robert E. Kingston (In: Current Protocols in 
Molecular Biology, Ausubel et al . (eds.), Greene 
Publishing Associates and Wiley-Interscience, New York, 
10 p. 4.8.1) with minor modifications. 

Polymerase chain reaction (PGR) was carried out 
using standard protocols supplied by the GeneAmp Kit 
(Perkin Elmer) . The primers used for primer extension 
and PGR are as follows: 

15 Pex2: 5 7 TCATCGACTTTTAGATGACTAGAGAACATC 3' 

(Seq. ID #24) ; , 
Pexl: 5' GTTGCACTGCTTTCACGATCTCCCGTCTCT 3' 

(Seq. ID #25) ; 
SL1: 5V GTTTAATTACCCAAGTTTGAG 3' (Seq. ID #26); 

2 0 SL2 : 5 ' GGTTTTAACCAGTTACTCAAG 3 ' ( Seq . ID #27 ) ; 

Log5: 5' CCGGTGACATTGGACACTC 3' (Seq. ID #28); and 

OligolO: 5' ACTATTCAACACTTG 3' (Seq. ID #29). 

Germline Transformation 

The procedure for microinjection basically follows 

25 that of A. Fire {EMBO J". 5:2673-2680 (1986)) with 

modifications: Cosmid DNA was twice purified by CsCl- 
gradient. Miniprep DNA was used when deleted cosmids 
were injected. To prepare miniprep DNA, DNA from 1.5 
ml overnight bacterial culture in superbroth (12 g 

30 Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 
900 ml H 2 0, autoclaved; after autoclaving, 100 ml 0.17 
M KH 2 P0 4 and 0.72 M KH 2 P0 4 were added) was extracted by 
alkaline lysis method as described in Maniatis et al. 
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(1983 supra). DNA was treated with RNase A (37° , 30 
minutes) and then with protease K (55° , 30 minutes), 
extracted with phenol and then chloroform, precipitated 
twice (first in 0.3 M sodium acetate and second in 0.1 . 
5 M potassium acetate, pH 7.2), and resuspended in 5 pi 
injection buffer as described by A. Fire (1986 supra) . 
The DNA concentration for injection is in the range of 
100 ug to 1 mg per ml. 

All transformation experiments used ced-l(el735) ; 
10 unc-31 (e928) ced-3 (n717) strain, unc-31 was used as a 
marker for co-transformation (Kim and Horvitz, 1990 
supra) . ced-l was present to facilitate scoring of the 
ced-3 phenotype. The mutations in ced-I block the 
engulf ment process of cell death, which makes the 
15 corpses of the dead ceils persist much longer than in 
wild-type animals (Hedgecock et al. , Science 220:1277- 
1280 (1983)). The ced-3 phenotype was scored as the 
number of dead cells present in the head of young LI 
animals. The cosmid C10D8 or the plasmid subclones of 
20 C10D8 were mixed with C14G10 (unc-31 (+) -containing) at 
a ratio of 2:1 or 3:1 to increase the chances that a 
Unc-31 (+) "transformant would contain the cosmid or 
plasmid being tested as well. Usually, 20-30 animals 
were injected in one experiment. Non-Unc Fl progeny of 
25 the injected animal were isolated three to four days 
later. About 1/2 to 1/3 of the non-Unc progeny 
transmitted the non-Unc phenotype to F2 progeny and 
established a transformant line. The young LI progeny 
of such non-Unc transformant were checked for the 
3 0 number of dead cells present in the head using Nomarski 
optics. 
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RESULTS 



Isolation of Aririj+ jonal ced-3 Allele 

All of the ced-3 alleles that existed previously 
were isolated in screens designed to detect viable 
5 mutants displaying the Ced phenotype (Ellis and 

Horvitz, cell 44:817-829 (1986)). Such screens may 
have systematically missed any class of ced-3 mutations 
that is inviable as homozygotes. For this reason, a 
scheme was designed that could isolate recessive lethal 
10 alleles of ced-3. Four new alleles of ced-3 (nll63, 
M164, nll6S, nl286) were isolated in this way. since 
new alleles were isolated at a frequency of about 1 in 
2500, close to the frequency expected for the 
generation of null mutations by EMS in an average c. 
15 elegans gene (Brenner, Genetics 77:71-94 (1974); 

Greenwald and Horvitz, Genetics 95:147-160 (1980)), and 
all four alleles are homozygous viable, it was ■ 
concluded that the null allele of ced-3 is viable. 



20 



25 



30 



Mappin g RFLPs near tre>r!-i 

Tel is a c. elegans transposable element that is 
thought to be immobile in the common laboratory Bristol 
strain and in the Bergerac strain (Emmons et al . , Cell 
32:55-65 (1983)). In the Bristol strain, there are 30 
copies of tci, while in the Bergerac strain, there are 
more than 400 copies of Tel (Emmons et al., i 98 3 supra; 
Finney, Ph. D. thesis, Massachusetts institute of 
Technology, Cambridge, Massachusetts, 1987). Because 
the size of the C. elegans genome is small (haploid 
genome size 8 x lo 7 bp) (Sulston and Brenner, Genetics 
77:95-104 (1976)), a polymorphism due to Tel between 
the Bristol and Bergerac strains would be expected to 
occur about once every 200 kb. Restriction fragment 
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10 



15 



length polymorphisms (RTLPs) can be used as genetic 
markers and mapped in a manner identical to 
conventional mutant phenotypes. A general scheme has 
been designed to map Tel elements that are dimorphic 
between the Bristol and Bergerac strains near any gene 
of interest (Ruvkun et al., Genetics, 122:501-516 
(1989)). once tight linkage of a particular Tel to a 
gene of interest has been established, that Tel can be 
cloned and used to initiate chromosome walking. 

A 5.1 kb Bristol-specific Tel EcoRI fragment was 
tentatively identified as containing the Tel closest to 
ced-3. This Tel fragment was cloned using cosmids from 
a set of Tcl-containing c. elegans Bristol genomic DNA 
fragments. DNA was prepared from 46 such TC1- 
containing cosmids, and this DNA was screened using 
Southern blots to identify the cosmids that contain a 
5.1 kb EcoRI Tcl-containing fragment. Two such cosmids 
were identified: MMM-C1 and MMM-C9. The 5.1 kb EcoRI 
fragment was subcloned from MMM-C1 into pUC13 
20 (Promega). since both ends of Tel contain an EcoRV 
site (Rosenzweig et ai., Nucleic Acids Res. 22:4201- 
4209 (1983)), EcoRV was used to remove Tel from the 5.1 
kb EcoRI fragment, generating a plasmid that contains 
only the unique flanking region of this Tcl-containing 
fragment. This plasmid was then used to map the 
specific Tel without the interference of other Tel 
elements . 

unc-30(el91) ced-3(n727j dpy-4 (ell66) /+++ males 
were crossed with Bergerac (EM1002) hermaphrodites, and 
Unc non-Dpy or Dpy non-Unc recombinants were picked 
from among the F2 progeny. The recombinants were 
allowed to self -fertilize, and strains that were 
homozygous for either unc-30 (el91) dpy-4 (Bergerac) or 
unc-30 {Bergerac) dpy-4 (ell66) were isolated. After 
identifying the ced genotypes of these recombinant 
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strains, DNA was prepared from these strains. A 
Southern blot of DNA from these recombinants was probed 
with the flanking sequence of the 5.1 kb EcoRI Tel 
fragment. This probe detects a 5.1 kb fragment in 
5 Bristol N2 and a 3.4 kb fragment in Bergerac. Five out' 
of five unc-30 ced-3 dpy (+Berg) recombinants, and one 
of one unc-30(+Berg) ced . 3 dpy _ 4 recombinants showed 
the Bristol pattern. Nine of ten unc-30 (+Berg) dpy-4 
recombinants showed the Bergerac pattern. Only one 
10 recombinant of unc-30 (+Bera) dpy-4 resulted from a 
cross-over between ced-3 and the 5.1 kb Tel element. 
The genetic distance between ced-3 and dpy-4 is 2 map 
units (mu). Thus, this Tel element is located 0.1 mu 
on the right side of ced-3. 
15 Cosmids MMM-C1 and MMM-C9 were used to test 

whether any previously mapped genomic DNA cosmids 
overlapped with these two cosmids. A contig of 
overlapping cosmids was identified that extended the 
cloned region near ced-3 in one direction. 
20 To orient MMM-ci with respect to this contig, both 

ends of MMM-CI were subcloned and these subclones were 
used to probe the nearest neighboring cosmid C48D1. 
The "right" end of MMM-CI does not hybridize to C4SD1 
while the "left" end does. Therefore, the "right" end 
of MMM-CI extends further away from the contig. To 
extend this contig, the "right" end of MMM-d was used 
to probe the filters of two cosmid libraries (Coulson 
et al., Proc. Natl. Acad. Sci. DSA 83:7821-7825 
(1986)). one clone, Jc8, was found to extend MMM-CI in 
3 0 the opposite direction of the contig. 

RFLPs between the Bergerac and Bristol strains 
were used to orient the contig with respect to the 
genetic map. Bristol (N2) and Bergerac (EM1002) DNA 
was digested with various restriction enzymes and 
35 probed with different cosmids to look for RFLPs . Once 
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such an RFLP was found, DNA from recombinants of the 
Bristol and Bergerac strains between ced-3 and unc-26 
and between unc-30 and ced-3 was used to determine the 
position of the RFLP with respect to ced-3 
5 The ..right" end of Jc8, which represents one end 

of the contig, detects an RFLP (nP33) when N2 and 
EM1002 DNA was digested with jriadlii. A Southern blot 
of DNA from recombinants between three ced-3 ( + Bera) 
unc-26 was probed with the "right" end of Jc8. Three 
10 of three + 3ergr unc-26 recombinants showed the Bristol 
pattern, while two of two ced-3 unc-26 ^Berg) 
recombinants showed the Bergerac pattern. Thus, nP33 
mapped very close or to the right side of unc-26. 

i* / *J h * " 1 ' ft " ° f JCS als ° detects a todI11 

15 (nP34) . The same Southern blot was reprobed with the 
Jca "left" end. Two of the two ced-3 unc-26 (+ Berg) 
recombinants and two of the three ced-3 (+ Berg) unc-26 
recombinants showed the Bergerac pattern. One of the 
three ced-3 ( + Berg) unc-26, recombinants showed the 
20 Bristol pattern. The genetic distance between ced-3 
and unc-26 is 0.2 mu. Thus, nP34 was mapped between 
ced-3 and unc-26, about 0.1 mu on the right side of 
ced-3 . 

The flanking sequence of the 5.1 Xb EcoRl Tcl 
25 fragment (named nP35) was used to probe the same set of 
recombinants. Two of three ced-3 (+Berg) unc-26 
recombinants and two of two ced-3 unc-2 6 (+Berg) 
recombinants showed the Bristol pattern. Thus nP35 
was also found to be located between ced-3 and' 'unc-26 
0 about 0.1 mu on the right side of ced-3. 

A similar analysis using cosmid T10H5 which 
contains the ffindlli rflp ( nP36) , and cosmid B0564 
which contains a tfindlli RFLP { nP37) , showed that nP3 6 
and nP37 mapped very close or to the right of unc -30 
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These experiments localized the ced-3 gene to an 
interval of three cosmids. The positions of the RFLPs, 
and of ced-3, unc-30 and unc-26 on chromosome IV, and 
their relationships to the cosmids are shown in Figure 
5 9. It was has been further demonstrated by 

microinjection that cosmids C37G8 and C33F2 carry the 
unc-30 gene (John Sulston, personnel communication) . 
Thus, the region containing the ced-3 gene was limited 
to an interval of two cosmids. These results are 
10 summarized in Figure 9. 

Complementation of ced-3 bv BwH M ^ n 

Cosmids that were candidates for containing the 
ced-3 gene were microinjected into a ced-3 mutant to 
see if they rescue the mutant phenotype. The procedure 
15 for microinjection was that of A. Fire (EMBO J. 5:2673- 
2680 (1986)) with modifications, unc-32, a mutant 
defective in locomotion, was used as a marker for • 
cotrans formation (Kim and Horvitz, Genes & Dev. 4:357- 
371 (1990)), because the phenotype of ced-3 can be 
examined only by using Nomarski optics. Cosmid C14G10 
(containing unc-32 (+) ) and a candidate cosmid were 
coinjected into ced-I (el375) ; unc-31(e928) ced-3(n717) 
hermaphrodites, and Fl non-Unc transf ormants were 
isolated to see if the non-Unc phenotype could be 
25 transmitted and established as a line of transf ormants . 
Young LI progeny of such transformants were examined 
for the presence of cell deaths using Nomarski optics - 
to see whether the ced-3 phenotype was suppressed. 
Cosmid C14G10 containing unc-32 alone does not rescue 
3 0 ced-3 activity when injected into a ced-3 mutant. 

Table 4 summarizes the results of these transformation 
experiments. 

As shown in Table 4, of the three cosmids injected 
(C43C9, W07H6 and C48D1) , only C48D1 rescued the ced-3 
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15 



phenotype (2/2 non-Unc transformants rescued the ced-3 
phenotype). one of the transfonnants, nSX2 , appears to 
be rescued by an extra-chromosomal array of injected 
cosmids (Way and Chalfie, Cell 54:5-16 (1988)), which 
is maintained as an unstable duplication, since only 
50% of the progeny of a non-Unc Ced(+) animal are 
non-Unc Ced(+). since the non-Unc Ced(+) phenotype of 
the other transf ormant (nisi) is transmitted to all of 
its progeny, it is presumably an integrated 
transformant. LI ced-1 animals contain an average of 
23 cell corpses in the head (Table 5). LI ced-1; ced-3 
animals contain an average of 0.3 cell corpses in the 
head.- ced-l; unc-31 ced-3; nisi and ced-1; unc-31 
ced-3; nEX2 animals contain an average of 16.4 and 14.5 
cell corpses in the head, respectively. From these 
results, it was concluded that G48D1 contains the ced-3 
gene. 

In order to locate ced-3 more precisely within the 
cosmid C48D1-, this cosmid was subcloned and the 
subclones were tested for the ability to rescue ced-3 
mutants (Table 5). C48D1 DNA was digested with 
restriction enzymes that cut rarely within the cosmid 
and the remaining cosmid was self-ligated to generate a 
subclone. Such subclones were then injected into a 
25 ced-3 mutant to look for complementation; young Ll 
non-Unc progeny of the transf ormants were examined 
using Nomarski optics for the presence of cell death in 
the head. When C48D1 was digested with BamHI and self-- 
ligated, the remaining 14 kb subclone (named C48D1-28) 
was found to rescue the ced-3 phenotype when injected 
into a ced-3 mutant (Figure 10 and Table 5) . C48D1-28 
was then partially digested with Belli and self- 
ligated. Clones of various lengths were isolated and 
tested. for their ability to rescue ced-3. 
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kb Jtt f C48D1 -"' which <"° not contain a l 7 

» J»1II fragment of C48M-28, was able to rescue ced-3 
(Figure 10 and Table 5, . C48 D i-43 was further 
subclones by digesting with Samj and 

into PBS KII + to generate P J40. pJ4 o can restore ceo--3 + 
phenotype when microinjected Into a ceo"-, mutant, p™ 
vas subcloned by deleting a 2 kb B? lH- ApaI fragnen * to 

10 ced-3 phenotype when microinjected into a ced-3 Mutant 

t ; n : f ^ M the le " of pno, could be 

made by E * o1II digestion (as in pjl07de!28 and 
P»107d.l3«, without meeting ced-3 Activity; in fact 
one transgenic line, nSX17 , restores full ced-3 
!5 activity. However, the ced-3 rescuing ability was 

;r f ; C ? y " dUCed Wh « n 1 *° — dieted on the left 

t»e abillT' ^ PJ1 ° 7de112 " d > J1 °'^"> . and 
the ability was completely eliminated when a !. 8 kb 

20 £107 (as in PJ55 and p J5S) , suggesting that this Sall 
sit. is likely to be in the ced-3 coding region. Fro* 
these experiments, ced-3 was localized to a DNA 
fragment of 7.5 kb. These results are summarized in 
Figure 10 and Table 5. 

25 T ranscrip t 

and defect T T & *° * Wot <* « «« 

and detected a band of 2.8 kb. Although this 

transcript is present in 12 ced-3 mutant animals 

subsequent analysis showed that all 12 cea . 3 autant 

= 0 alleles contain mutations in the genomic DNA that codes 

as"a^ TT <SSe bel ° W) ' thus «^ b "^ng this m 
as a ced-3 transcript. 

The developmental expression pattern of ced-3 was 
determined by hybridizing a Northern blot of RNA from 

4^ 
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ammals of different stages (eggs, LI through L4 larvae 
and young adult) with the ced-3 cDNA subclone pJlis. 
Such analysis revealed that the ced-3 transcript is 
most abundant during embryonic development, which is 
5 the period when most programmed cell deaths occur, but " 
it was also detected during the LI through L4 larval 
stages and is present in relatively high levels in 
young adults. This result suggests that ced-3 is not 
only expressed in cells undergoing programmed cell 
10 death. 

Since ced-3 and ced-4 are both required for 
programmed cell death in C. elegans, one of the genes 
m^ght act as a regulator of transcription of the other 
gene. To examine if ced-4 regulates the transcription 
15 of ced-3, RNA was prepared from eggs of ced-4 mutants 
(n!162, 211416, nlQ94, and 1X1920), and a Northern blot 
was probed with the ced-3 cDNA subclone pJlis The 
presence of RNA in each lane was confirmed with an : 
actin I probe. Such an experiment showed that the 
20 level of ced-3 transcript is normal in ced-4 mutants. 
This indicates that ced-4 is unlikely to be a 
transcriptional regulator of ced-3. 

Isolation of -a r~ e a-3 cnwft 

To isolate cDNA of ced-3, pJ40 was used as a probe - 
25 to screen a cDNA library of N2 (Kim and Horvitz, Genes 
& Dev. 4:357-371 (1990)). Seven cDNA clones were 
isolated. These cDNAs can be divided into two groups- ' 
one is 3.5 kb and the other 2.5 kb. One cDNA from each 
group was subcloned and analyzed further. pJ8 5 
30 contains the 3.5 kb cDNA. Experiments showed that pJ85 
contains a ced-3 cDNA fused to an unrelated cDNA; on 
Northern, blots of N2 RNA, the pJ85 insert hybridizes to 
two RNA transcripts, and on Southern blots of N2 DNA, 
P J85 hybridizes to more than one band than pj 40 (ced-3 
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genomic DNA) does. pJ87 contains the 2.5 kb cDNA. On 
Northern blots, pJ87 hybridizes to a 2.8 kb RNA and on 
Southern blots, it hybridizes only to bands to which 
pJ40 hybridizes. Thus, pJ8 7 contains only ced-3 cDNA. 
5 To show that pJ87 does contain the ced-3 cDNA, a 

frameshift mutation was made in the Sail site of pJ40 
corresponding to the Sail site in the pJ87 cONA. 
Constructs containing the frameshift mutation failed to 
rescue the ced-3 phenotype when microinjected into 
10 ced-3 mutant animals, suggesting that ced-3 activity 
has been eliminated. 

ced-3 Sequence 

The DNA sequence of pJ87 was determined (see 
Figure 4; Seq. ID #18). pJ87 contains an insert of 2.5 
15 kb which has an open reading frame of 503 amino acids 
(Figure 4; Seq. ID #19). The 5' end of the cDNA 
contains 25 bp of poly-A/T sequence, which is probably 
an artifact of cloning and is not present in the 
genomic sequence. The cDNA ends with a poly-A 
20 sequence, suggesting that it contains the complete 3' 
end of the transcript. l kb of pJ87 insert is 
untranslated 3' region and not all of it is essential 
for ced-3 expression, since genomic constructs with 
deletions of 38 0 bp of the 3' end can still rescue 
25 ced-3 mutants (pJ107 and its derivatives, see Figure 
10) . 

To confirm the DNA sequence obtained from the 
ced-3 cDNA and to study the structure of the ced-3 
gene, the genomic sequence of the ced-3 gene in the 
30 plasmid pJ107 was determined (Figure 4; Seq. ID #18). 
Comparison of the ced-3 genomic and cDNA sequences 
revealed that the ced-3 gene has seven introns that 
range in size from 54 bp to 1195 bp (Figure 5A) . The 
four largest introns, as well as sequences 5' of the 
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start codon, were found to contain repetitive elements. 
Five types of repetitive elements were found, some of 
which have been previously characterized in non-coding 
regions of other c. elegans genes such as fem-1 (Spence 
5 etal., cell 50:981-990 (1990)), lin-12 (J. Yochem, 
personal communication), and myoD (Krause et al. t Cell 
63:907-919 (1990)) (Figure 4). Of these, repeat 1 was 
also found in fem-i and myoD, repeat 3 in lin -12 and 
fem-1, repeat 4 in lin-12, and repeats 2 and 5 were 
10 novel repetitive elements. 

A combination of primer extension and PCR 
amplification was used to determine the location and 
nature of the 5' end of the ced-3 transcript. Two 
primers (Pexl and Pex2) were used for the primer 
15 extension reaction. The Pexl reaction yielded two 
major bands, whereas the Pex2 reaction gave one band. 
The Pex2 band corresponded in size to the smaller band 
from the Pexl reaction, and agreed in length with a 
possible transcript that is trans-spliced to a C. 
20 elegans splice leader (Bektesh, Genes & Dev., 2:1211- 
1283 (1988)) at a consensus splice acceptor at position 
2166 of the genomic sequence (Figure 4). The nature of 
the larger Pexl band is unclear. 

To confirm the existence of this trans-spliced 
25 message in wild-type worms, total C. elegans RNA was 
PCR amplified using the SLl-Log5 and SL2-Log5 primer 
pairs, followed by a reamplif ication using the SL1- 
OligolO and SL2-01igolO primer pairs. The SL1 reaction" 
yielded a fragment of the predicted length. The 
3 0 identity of this fragment was confirmed by sequencing. 
Thus, at least some, if not most, of the ced-3 
transcript is trans-spliced to SL1. Based on this 
result, the start codon of the ced-3 message was 
assigned to the methionine encoded at position 2232 of 
35 the genomic sequence (Figure 4) . 
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The DNA sequences of 12 EMS-induced ced-3 alleles 
were also determined (Figure 4 and Table 3). Nine of 
the 12 are missense mutations. Two of the 12 are 
nonsense mutations, which might prematurely terminate 
the translation of ced-3. These nonsense ced-3 mutants 
confirmed that the ced-3 gene is not essential for 
viability, one of the 12 mutations is an alteration of 
a conserved splicing acceptor G, and another has a 
change of a 70% conserved C at the splice site, which 
could also generate a stop codon even if the splicing 
is correct. Interestingly, these EMS-induced mutations 
are in either the N-terminal quarter or C-terminal -half 
of the protein, in fact, 9 of the 12 mutations occur 
within the region of ced-3 that encodes the last 100 
15 amino acids of the protein. Mutations are notably 

absent from the middle part of the ced-3 gene (Figure 
5) . 

Ced-3 Protein Contain* a Region p^ h in SPrinoc 
20 The Ced-3 protein is very hydrophilic and no 

significantly hydrophobic region can be found that 
might be a trans-membrane domain (Figure 6). The Ced-3 
protein is rich in serine. From amino acid 78 to amino 
acid 205 of the Ced-3 protein, 34 out of 127 amino 
25 acids are serine. Serine is often the target of 

serine/ threonine protein kinases (Edelman, Ann. Rev. 
Biochem. 56:567-613 (1987)). For example, protein 
kinase c can phosphorylate serines when they are 
flanked on their amino and carboxyl sides by basic 
30 residues (Edelman, 1987 supra). Four of the serines in 
the Ced-3 protein are flanked by arginines (Figure 4) . 
The same serine residues might also be the target of 
related Ser/Thr kinases. 

To identify the functionally important regions of 
3 5 the Ced-3 protein, genomic DNAs containing the ced-3 
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genes from two related nematode species, c. briggsae 
and C. vulgaris were cloned and sequenced (Figure 7; 
Seq. id /20 and 21). Sequence comparison of the three 
ced-3 genes showed that the non-serine-rich region of 
5 the proteins is highly conserved, in C. briggsae and 
C. vulgaris, many amino acids in the serine-rich region 
are dissimilar compared to the C. elegans Ced-3 protein 
(Figure 7} . it seems that what is important in the 
serine-rich region is the overall serine-rich feature 
10 rather than the exact amino acid sequence. 

This hypothesis is also supported by analysis of 
ced-3 mutations in C. elegans: none of the 12 EMS- 
induced mutations is in the serine-rich region, 
suggesting, that mutations in this region might'not 
15 affect the function of the Ced-3 protein and thus, 

could not be isolated in the screen for ced-3 mutants. 
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Tablo 1 
Rescue of the Cad-4 
Pheaoty pe pv Garni ina Trans f op mifr^rm 



Ganotype 



Ho. 

DMA Avg. No. Call Animals 

Injected Corpses (LI Head) Scorad 



ced-1; ced-4; 
unc-31; nExl 



C10D8; 
C14G10 



9.4 



10 



ced-1; ced-4; 
unc-31; nEx7 



C10D8-5 
C14G10 



11.5 



10 



ced-1; ced-4 
unc-31; nEx8 



C10D8-5 
C14G10 



11.5 



10 



ced-1 



None 



23 



20 



ced-1; ced-4 



None 
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Table 4 

Summary ef Tranaf o rmation Fhroarim«Tifc« 
Using Cosmida in *h a cerf-? » a<r '„» 



Coamid 
iaiectad 

C43C9; C14G10 

W07H6; C14G10 



C48D1;- C14G10 



No. of non-Unc 
tranaf ormanta 

■1 

3 



Cad-3 
phenotyp a 



Strain nrtmn 



MT4302 

MT4299 
MT4300 
MT4301 

MT4298 
MT4303 



Animals injected were of genotype: ced-i (el735) ; unc-3i(e929) 
ced-3(n7i7). 
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Genotype 



ced-2 
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The earereaaion of ced-3 f+) tranaf ormanta 



DNA injected 



Average Ho. 
cell deaths 
in LI head 



23 



Ho. 

Animals 
scored 



20 



ced-2; ced-3 



0.3 



10 



ced-1; nISl 
unc-31 ced-3 



C48D1; 
C14G10 



16.4 



20 



ced-1; unc-31 
ced-3; nISl/+ 



14.5 



20 



ced-1; unc-31 
ced-3; nEX2 



C48D1; 
C14G10 



13,2 
0 



10/14 
4/14 



ced-1; unc-31 
ced-3; nEXlO 



C48D1-28; 
C14G10 



12 
0 



9/10 

1 Of 10 



ced-1; unc-31 
ced-3; nEX9 



C48D1-28; 
C14G10 



12 



10 



ced-1; unc-31 
ced-3; nEXll 



C48D1-43 
C14G10 



16.7 



10/13 



Abnormal cell 3/13 
deaths 



ced-1; unc-31 
ced-3; nEX13 



pJ4 0; C14G10 



13 .75 



4/4 
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TaJala 5 continued 



csd-1; unc-31 pJ107del28, 23 12/14 

ced-3; nEX17 pJ107del34 

C14G10 

0 2/14 

ced-1; unc-3l pJ107del28, 12.8 9/10 
ced-3; nEXia pJ107dell34 

C14G10 

0 1/10 

ced-1; unc-31 pJ107del28, 10.6 5/6 

ced-3; nZX19 pJ107del34 

G14G10 

0 1/6 

ced-1; unc-31 pJ107dell2, 7.8 12/12 

ced-3; nEX16 pJ107del27 

C14G10 

Alleles of the genes used are ced-1 (el735) , unc-31 (e928) , and ced- 
3(n717). 



10 



15 
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20 



able tT" SJCUled in a " " ai - be 

able to ascertain using no more than routine 

experimentation, many equivalents to the specific 

embo^ents o f the invention described herein. Such 

equivalents are intended to be encompassed by the 

follows claims. Ko r example, functional equivalents 

through the degeneracy of the genetic code, encode the 
same proteins as those specifically claimed 

o^ n ml i °r 1 / <IUiValentS ° f Pr0tSlnS b * substituted 

or mod lfl ed amino acid sequences, wherein the 

substitution or modification does not change the 

act lvlt y or function of the protein. A "silent" amino 

acid substitution, such that a chemically simlL Z no 

acid (e.g., an acidic amino acid with another acidic 

f^cti"? 1S . SUbStituted ' - ■» "ample of how a 
functional equivalent of a protein can be produced! 

TsTlT 1 : 9UlV3lentS ° f " ucl "= "I-' =r proteins can 
also be produced by deletion o, nonessential sequences. 
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